OntologyRAG: Better and Faster Biomedical Code Mapping with Retrieval-Augmented Generation (RAG) Leveraging Ontology Knowledge Graphs and Large Language Models
Hui Feng, Yuntzu Yin, Emiliano Reynares, Jay Nanavati

TL;DR
OntologyRAG enhances biomedical code mapping by integrating ontological knowledge graphs with large language models, enabling faster, more accurate, and interpretable mappings without retraining LLMs.
Contribution
We introduce OntologyRAG, a retrieval-augmented generation approach that leverages ontological knowledge graphs for improved biomedical code mapping without re-training language models.
Findings
Improved code mapping accuracy on a curated dataset.
Faster mapping process for domain experts.
No need for re-training LLMs with ontology updates.
Abstract
Biomedical ontologies, which comprehensively define concepts and relations for biomedical entities, are crucial for structuring and formalizing domain-specific information representations. Biomedical code mapping identifies similarity or equivalence between concepts from different ontologies. Obtaining high-quality mapping usually relies on automatic generation of unrefined mapping with ontology domain fine-tuned language models (LMs), followed by manual selections or corrections by coding experts who have extensive domain expertise and familiarity with ontology schemas. The LMs usually provide unrefined code mapping suggestions as a list of candidates without reasoning or supporting evidence, hence coding experts still need to verify each suggested candidate against ontology sources to pick the best matches. This is also a recurring task as ontology sources are updated regularly to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Biomedical Text Mining and Ontologies · Topic Modeling
MethodsSparse Evolutionary Training · Ontology
