PromptLink: Leveraging Large Language Models for Cross-Source Biomedical   Concept Linking

Yuzhang Xie; Jiaying Lu; Joyce Ho; Fadi Nahab; Xiao Hu; Carl Yang

arXiv:2405.07500·cs.IR·May 14, 2024

PromptLink: Leveraging Large Language Models for Cross-Source Biomedical Concept Linking

Yuzhang Xie, Jiaying Lu, Joyce Ho, Fadi Nahab, Xiao Hu, Carl Yang

PDF

1 Repo

TL;DR

PromptLink is a novel framework that leverages large language models to improve biomedical concept linking across diverse data sources by generating candidate concepts and using a two-stage prompting process for enhanced reliability.

Contribution

It introduces a generic, knowledge-agnostic framework utilizing LLMs for biomedical concept linking, overcoming limitations of prior rule-based and machine learning methods.

Findings

01

Effective on EHR datasets and biomedical knowledge graphs

02

No reliance on additional prior knowledge or training data

03

Demonstrates strong zero-shot prediction capabilities

Abstract

Linking (aligning) biomedical concepts across diverse data sources enables various integrative analyses, but it is challenging due to the discrepancies in concept naming conventions. Various strategies have been developed to overcome this challenge, such as those based on string-matching rules, manually crafted thesauri, and machine learning models. However, these methods are constrained by limited prior biomedical knowledge and can hardly generalize beyond the limited amounts of rules, thesauri, or training samples. Recently, large language models (LLMs) have exhibited impressive results in diverse biomedical NLP tasks due to their unprecedentedly rich prior knowledge and strong zero-shot prediction abilities. However, LLMs suffer from issues including high costs, limited context length, and unreliable predictions. In this research, we propose PromptLink, a novel biomedical concept…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

constantjxyz/promptlink
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.