Self-Prompting Small Language Models for Privacy-Sensitive Clinical Information Extraction

Yao-Shun Chuang; Tushti Mody; Uday Pratap Singh; Shirindokht Shiraz; Chun-Teh Lee; Ryan Brandon; Muhammad F Walji; Xiaoqian Jiang; Bunmi Tokede

arXiv:2605.04221·cs.CL·May 7, 2026

Self-Prompting Small Language Models for Privacy-Sensitive Clinical Information Extraction

Yao-Shun Chuang, Tushti Mody, Uday Pratap Singh, Shirindokht Shiraz, Chun-Teh Lee, Ryan Brandon, Muhammad F Walji, Xiaoqian Jiang, Bunmi Tokede

PDF

TL;DR

This paper presents a framework enabling small language models to self-generate and optimize prompts for extracting clinical entities from dental notes, improving privacy-sensitive clinical information extraction.

Contribution

It introduces a locally deployable, self-prompting approach with fine-tuning and preference optimization for small models in clinical NLP tasks.

Findings

01

Qwen2.5-14B-Instruct achieved the highest baseline performance.

02

Post-training with DPO improved model F1 scores significantly.

03

Task-specific evaluation revealed substantial performance variation among models.

Abstract

Clinical named entity recognition from dental progress notes is challenging because documentation is highly unstructured, domain-specific, and often privacy-sensitive. We developed a locally deployable framework that enables small language models to self-generate, verify, refine, and evaluate entity-specific prompts for extracting multiple clinical entities from dental notes. Using 1,200 annotated notes, we evaluated candidate open-weight models with multi-prompt ensemble inference and further adapted selected models using QLoRA-based supervised fine-tuning and direct preference optimization. Model performance varied substantially, highlighting the need for task-specific evaluation rather than reliance on generic benchmarks. Qwen2.5-14B-Instruct achieved the strongest baseline performance. After DPO, Qwen2.5-14B-Instruct and Llama-3.1-8B-Instruct achieved micro/macro F1 scores of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.