Few-shot clinical entity recognition in English, French and Spanish: masked language models outperform generative model prompting
Marco Naguib, Xavier Tannier, Aur\'elie N\'ev\'eol

TL;DR
This study evaluates the performance of various large language models for few-shot clinical named entity recognition across English, French, and Spanish, finding masked models outperform generative models in clinical settings.
Contribution
It provides a comprehensive comparison of auto-regressive and masked LLMs for clinical NER in multiple languages, highlighting the superiority of masked models in this domain.
Findings
Masked models outperform auto-regressive models in clinical NER.
Prompt-based auto-regressive models are competitive in general NER.
Masked models have lower environmental impact.
Abstract
Large language models (LLMs) have become the preferred solution for many natural language processing tasks. In low-resource environments such as specialized domains, their few-shot capabilities are expected to deliver high performance. Named Entity Recognition (NER) is a critical task in information extraction that is not covered in recent LLM benchmarks. There is a need for better understanding the performance of LLMs for NER in a variety of settings including languages other than English. This study aims to evaluate generative LLMs, employed through prompt engineering, for few-shot clinical NER. %from the perspective of F1 performance and environmental impact. We compare 13 auto-regressive models using prompting and 16 masked models using fine-tuning on 14 NER datasets covering English, French and Spanish. While prompt-based auto-regressive models achieve competitive F1 for general…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Natural Language Processing Techniques
