Large Language Models Struggle in Token-Level Clinical Named Entity Recognition
Qiuhao Lu, Rui Li, Andrew Wen, Jinlian Wang, Liwei Wang, Hongfang Liu

TL;DR
This paper investigates the effectiveness of large language models in token-level clinical named entity recognition, highlighting challenges and potential improvements for healthcare applications, especially in rare disease contexts.
Contribution
It is the first comprehensive study comparing proprietary and local LLMs for token-level clinical NER using various prompting and fine-tuning methods.
Findings
LLMs face significant challenges in token-level clinical NER.
Prompting and fine-tuning can improve LLM performance in this task.
Local open-source LLMs show potential but still lag behind proprietary models.
Abstract
Large Language Models (LLMs) have revolutionized various sectors, including healthcare where they are employed in diverse applications. Their utility is particularly significant in the context of rare diseases, where data scarcity, complexity, and specificity pose considerable challenges. In the clinical domain, Named Entity Recognition (NER) stands out as an essential task and it plays a crucial role in extracting relevant information from clinical texts. Despite the promise of LLMs, current research mostly concentrates on document-level NER, identifying entities in a more general context across entire documents, without extracting their precise location. Additionally, efforts have been directed towards adapting ChatGPT for token-level NER. However, there is a significant research gap when it comes to employing token-level NER for clinical texts, especially with the use of local…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning in Healthcare
