BioLORD-2023: Semantic Textual Representations Fusing LLM and Clinical Knowledge Graph Insights
Fran\c{c}ois Remy, Kris Demuynck, Thomas Demeester

TL;DR
BioLORD-2023 introduces a novel biomedical semantic model that combines Large Language Models with clinical knowledge graphs, achieving state-of-the-art results and supporting multiple languages for diverse biomedical NLP tasks.
Contribution
We propose a new approach integrating LLMs and knowledge graphs with contrastive learning, self-distillation, and weight averaging, resulting in superior biomedical semantic representations.
Findings
Achieved +2 points on MedSTS
Achieved +2.5 points on MedNLI-S
Achieved +6.1 points on EHR-Rel-B
Abstract
In this study, we investigate the potential of Large Language Models to complement biomedical knowledge graphs in the training of semantic models for the biomedical and clinical domains. Drawing on the wealth of the UMLS knowledge graph and harnessing cutting-edge Large Language Models, we propose a new state-of-the-art approach for obtaining high-fidelity representations of biomedical concepts and sentences, consisting of three steps: an improved contrastive learning phase, a novel self-distillation phase, and a weight averaging phase. Through rigorous evaluations via the extensive BioLORD testing suite and diverse downstream tasks, we demonstrate consistent and substantial performance improvements over the previous state of the art (e.g. +2pts on MedSTS, +2.5pts on MedNLI-S, +6.1pts on EHR-Rel-B). Besides our new state-of-the-art biomedical model for English, we also distill and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗FremyCompany/BioLORD-2023-Mmodel· 22k dl· ♡ 1922k dl♡ 19
- 🤗FremyCompany/BioLORD-2023model· 24k dl· ♡ 5124k dl♡ 51
- 🤗FremyCompany/BioLORD-2023-Cmodel· 47k dl· ♡ 747k dl♡ 7
- 🤗FremyCompany/BioLORD-2023-Smodel· 261 dl· ♡ 2261 dl♡ 2
- 🤗FremyCompany/BioLORD-2023-M-Dutch-InContext-v1model· 15k dl· ♡ 415k dl♡ 4
- 🤗UMCU/BioLORD-2023-M-Dutch-InContext-v1-ST_bf16model· 2 dl2 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Artificial Intelligence in Healthcare and Education
MethodsContrastive Learning
