Utilizing Large Language Models for Zero-Shot Medical Ontology Extension from Clinical Notes
Guanchen Wu, Yuzhang Xie, Huanwei Wu, Zhe He, Hui Shao, Xiao Hu, Carl Yang

TL;DR
This paper introduces CLOZE, a zero-shot framework utilizing large language models to automatically extract and integrate new medical concepts from clinical notes into existing ontologies, enhancing biomedical data coverage without additional training.
Contribution
CLOZE is a novel zero-shot approach that leverages LLMs for automatic, scalable, and privacy-preserving ontology extension from unstructured clinical notes, without requiring labeled data.
Findings
CLOZE accurately identifies disease concepts from clinical notes.
The framework effectively captures hierarchical relationships in medical ontologies.
CLOZE demonstrates scalability and privacy preservation in ontology extension.
Abstract
Integrating novel medical concepts and relationships into existing ontologies can significantly enhance their coverage and utility for both biomedical research and clinical applications. Clinical notes, as unstructured documents rich with detailed patient observations, offer valuable context-specific insights and represent a promising yet underutilized source for ontology extension. Despite this potential, directly leveraging clinical notes for ontology extension remains largely unexplored. To address this gap, we propose CLOZE, a novel framework that uses large language models (LLMs) to automatically extract medical entities from clinical notes and integrate them into hierarchical medical ontologies. By capitalizing on the strong language understanding and extensive biomedical knowledge of pre-trained LLMs, CLOZE effectively identifies disease-related concepts and captures complex…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Biomedical Text Mining and Ontologies · Topic Modeling
