A Language Model based Framework for New Concept Placement in Ontologies
Hang Dong, Jiaoyan Chen, Yuan He, Yongsheng Gao, Ian Horrocks

TL;DR
This paper presents a neural network-based framework for inserting new concepts into ontologies, utilizing PLMs and LLMs for candidate search, edge formation, and selection, with evaluations on biomedical datasets.
Contribution
It introduces a multi-step approach leveraging neural methods and contrastive learning for ontology extension, including explainable instruction tuning for LLMs.
Findings
Fine-tuned PLMs excel in candidate search.
Multi-label Cross-encoder improves edge selection.
Explainable instruction tuning enhances LLM performance.
Abstract
We investigate the task of inserting new concepts extracted from texts into an ontology using language models. We explore an approach with three steps: edge search which is to find a set of candidate locations to insert (i.e., subsumptions between concepts), edge formation and enrichment which leverages the ontological structure to produce and enhance the edge candidates, and edge selection which eventually locates the edge to be placed into. In all steps, we propose to leverage neural methods, where we apply embedding-based methods and contrastive learning with Pre-trained Language Models (PLMs) such as BERT for edge search, and adapt a BERT fine-tuning-based multi-label Edge-Cross-encoder, and Large Language Models (LLMs) such as GPT series, FLAN-T5, and Llama 2, for edge selection. We evaluate the methods on recent datasets created using the SNOMED CT ontology and the MedMentions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Service-Oriented Architecture and Web Services · Web Data Mining and Analysis
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Sparse Evolutionary Training · Cosine Annealing · Linear Layer · Discriminative Fine-Tuning · Linear Warmup With Cosine Annealing · Dropout · Layer Normalization · Byte Pair Encoding · Attention Dropout
