Large Language Models Struggle in Token-Level Clinical Named Entity   Recognition

Qiuhao Lu; Rui Li; Andrew Wen; Jinlian Wang; Liwei Wang; Hongfang Liu

arXiv:2407.00731·cs.CL·August 20, 2024·6 cites

Large Language Models Struggle in Token-Level Clinical Named Entity Recognition

Qiuhao Lu, Rui Li, Andrew Wen, Jinlian Wang, Liwei Wang, Hongfang Liu

PDF

Open Access 1 Repo

TL;DR

This paper investigates the effectiveness of large language models in token-level clinical named entity recognition, highlighting challenges and potential improvements for healthcare applications, especially in rare disease contexts.

Contribution

It is the first comprehensive study comparing proprietary and local LLMs for token-level clinical NER using various prompting and fine-tuning methods.

Findings

01

LLMs face significant challenges in token-level clinical NER.

02

Prompting and fine-tuning can improve LLM performance in this task.

03

Local open-source LLMs show potential but still lag behind proprietary models.

Abstract

Large Language Models (LLMs) have revolutionized various sectors, including healthcare where they are employed in diverse applications. Their utility is particularly significant in the context of rare diseases, where data scarcity, complexity, and specificity pose considerable challenges. In the clinical domain, Named Entity Recognition (NER) stands out as an essential task and it plays a crucial role in extracting relevant information from clinical texts. Despite the promise of LLMs, current research mostly concentrates on document-level NER, identifying entities in a more general context across entire documents, without extracting their precise location. Additionally, efforts have been directed towards adapting ChatGPT for token-level NER. However, there is a significant research gap when it comes to employing token-level NER for clinical texts, especially with the use of local…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

qiuhaolu/tner
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning in Healthcare