A Language Model based Framework for New Concept Placement in Ontologies

Hang Dong; Jiaoyan Chen; Yuan He; Yongsheng Gao; Ian Horrocks

arXiv:2402.17897·cs.CL·March 5, 2024·1 cites

A Language Model based Framework for New Concept Placement in Ontologies

Hang Dong, Jiaoyan Chen, Yuan He, Yongsheng Gao, Ian Horrocks

PDF

Open Access 1 Repo

TL;DR

This paper presents a neural network-based framework for inserting new concepts into ontologies, utilizing PLMs and LLMs for candidate search, edge formation, and selection, with evaluations on biomedical datasets.

Contribution

It introduces a multi-step approach leveraging neural methods and contrastive learning for ontology extension, including explainable instruction tuning for LLMs.

Findings

01

Fine-tuned PLMs excel in candidate search.

02

Multi-label Cross-encoder improves edge selection.

03

Explainable instruction tuning enhances LLM performance.

Abstract

We investigate the task of inserting new concepts extracted from texts into an ontology using language models. We explore an approach with three steps: edge search which is to find a set of candidate locations to insert (i.e., subsumptions between concepts), edge formation and enrichment which leverages the ontological structure to produce and enhance the edge candidates, and edge selection which eventually locates the edge to be placed into. In all steps, we propose to leverage neural methods, where we apply embedding-based methods and contrastive learning with Pre-trained Language Models (PLMs) such as BERT for edge search, and adapt a BERT fine-tuning-based multi-label Edge-Cross-encoder, and Large Language Models (LLMs) such as GPT series, FLAN-T5, and Llama 2, for edge selection. We evaluate the methods on recent datasets created using the SNOMED CT ontology and the MedMentions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

krr-oxford/lm-ontology-concept-placement
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Service-Oriented Architecture and Web Services · Web Data Mining and Analysis

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Sparse Evolutionary Training · Cosine Annealing · Linear Layer · Discriminative Fine-Tuning · Linear Warmup With Cosine Annealing · Dropout · Layer Normalization · Byte Pair Encoding · Attention Dropout