Governance-Ready Small Language Models for Medical Imaging: Prompting, Abstention, and PACS Integration
Yiting Wang, Ziwei Wang, Di Zhu, Jiachen Zhong, Weiyi Li

TL;DR
This paper introduces a comprehensive framework for deploying small language models in medical imaging workflows, emphasizing governance, calibration, and integration with clinical standards to enable safe and effective utility.
Contribution
It presents a prompt-first deployment framework with calibration, oversight, and integration strategies tailored for medical imaging applications using small language models.
Findings
Reflection-oriented prompts improve lighter models' performance
Operational calibration reduces expected errors and oversight burden
Outputs are mapped to DICOM, HL7, and FHIR standards for integration
Abstract
Small Language Models (SLMs) are a practical option for narrow, workflow-relevant medical imaging utilities where privacy, latency, and cost dominate. We present a governance-ready recipe that combines prompt scaffolds, calibrated abstention, and standards-compliant integration into Picture Archiving and Communication Systems (PACS). Our focus is the assistive task of AP/PA view tagging for chest radiographs. Using four deployable SLMs (Qwen2.5-VL, MiniCPM-V, Gemma 7B, LLaVA 7B) on NIH Chest X-ray, we provide illustrative evidence: reflection-oriented prompts benefit lighter models, whereas stronger baselines are less sensitive. Beyond accuracy, we operationalize abstention, expected calibration error, and oversight burden, and we map outputs to DICOM tags, HL7 v2 messages, and FHIR ImagingStudy. The contribution is a prompt-first deployment framework, an operations playbook for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
