TL;DR
PromptLink is a novel framework that leverages large language models to improve biomedical concept linking across diverse data sources by generating candidate concepts and using a two-stage prompting process for enhanced reliability.
Contribution
It introduces a generic, knowledge-agnostic framework utilizing LLMs for biomedical concept linking, overcoming limitations of prior rule-based and machine learning methods.
Findings
Effective on EHR datasets and biomedical knowledge graphs
No reliance on additional prior knowledge or training data
Demonstrates strong zero-shot prediction capabilities
Abstract
Linking (aligning) biomedical concepts across diverse data sources enables various integrative analyses, but it is challenging due to the discrepancies in concept naming conventions. Various strategies have been developed to overcome this challenge, such as those based on string-matching rules, manually crafted thesauri, and machine learning models. However, these methods are constrained by limited prior biomedical knowledge and can hardly generalize beyond the limited amounts of rules, thesauri, or training samples. Recently, large language models (LLMs) have exhibited impressive results in diverse biomedical NLP tasks due to their unprecedentedly rich prior knowledge and strong zero-shot prediction abilities. However, LLMs suffer from issues including high costs, limited context length, and unreliable predictions. In this research, we propose PromptLink, a novel biomedical concept…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
