Clinical information extraction for Low-resource languages with Few-shot   learning using Pre-trained language models and Prompting

Phillip Richter-Pechanski; Philipp Wiesenbach; Dominic M. Schwab,; Christina Kiriakou; Nicolas Geis; Christoph Dieterich; Anette Frank

arXiv:2403.13369·cs.CL·December 18, 2024·1 cites

Clinical information extraction for Low-resource languages with Few-shot learning using Pre-trained language models and Prompting

Phillip Richter-Pechanski, Philipp Wiesenbach, Dominic M. Schwab,, Christina Kiriakou, Nicolas Geis, Christoph Dieterich, Anette Frank

PDF

Open Access

TL;DR

This paper evaluates prompt-based few-shot learning with lightweight models for clinical information extraction in low-resource languages, demonstrating significant accuracy improvements and emphasizing interpretability.

Contribution

It provides the first systematic evaluation of prompt-based few-shot learning for clinical text classification in low-resource settings, with detailed interpretability analysis.

Findings

01

Prompted models with 20 shots outperform traditional models by 30.5% accuracy.

02

Lightweight, domain-adapted pretrained models are effective for low-resource clinical NLP.

03

Shapley values validate interpretability and quality of small training datasets.

Abstract

Automatic extraction of medical information from clinical documents poses several challenges: high costs of required clinical expertise, limited interpretability of model predictions, restricted computational resources and privacy regulations. Recent advances in domain-adaptation and prompting methods showed promising results with minimal training data using lightweight masked language models, which are suited for well-established interpretability methods. We are first to present a systematic evaluation of these methods in a low-resource setting, by performing multi-class section classification on German doctor's letters. We conduct extensive class-wise evaluations supported by Shapley values, to validate the quality of our small training data set and to ensure the interpretability of model predictions. We demonstrate that a lightweight, domain-adapted pretrained model, prompted with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsSparse Evolutionary Training