Few-shot clinical entity recognition in English, French and Spanish:   masked language models outperform generative model prompting

Marco Naguib; Xavier Tannier; Aur\'elie N\'ev\'eol

arXiv:2402.12801·cs.CL·October 10, 2024·1 cites

Few-shot clinical entity recognition in English, French and Spanish: masked language models outperform generative model prompting

Marco Naguib, Xavier Tannier, Aur\'elie N\'ev\'eol

PDF

Open Access

TL;DR

This study evaluates the performance of various large language models for few-shot clinical named entity recognition across English, French, and Spanish, finding masked models outperform generative models in clinical settings.

Contribution

It provides a comprehensive comparison of auto-regressive and masked LLMs for clinical NER in multiple languages, highlighting the superiority of masked models in this domain.

Findings

01

Masked models outperform auto-regressive models in clinical NER.

02

Prompt-based auto-regressive models are competitive in general NER.

03

Masked models have lower environmental impact.

Abstract

Large language models (LLMs) have become the preferred solution for many natural language processing tasks. In low-resource environments such as specialized domains, their few-shot capabilities are expected to deliver high performance. Named Entity Recognition (NER) is a critical task in information extraction that is not covered in recent LLM benchmarks. There is a need for better understanding the performance of LLMs for NER in a variety of settings including languages other than English. This study aims to evaluate generative LLMs, employed through prompt engineering, for few-shot clinical NER. %from the perspective of F1 performance and environmental impact. We compare 13 auto-regressive models using prompting and 16 masked models using fine-tuning on 14 NER datasets covering English, French and Spanish. While prompt-based auto-regressive models achieve competitive F1 for general…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Natural Language Processing Techniques