MedAction: Towards Active Multi-turn Clinical Diagnostic LLMs
Hsin-Ling Hsu, Zizheng Wang, Donghua Zhang, Nai-Chia Chen, Jerry Wang, Jun-En Ding, Chia-Hsuan Hsu, Guoan Wang, Feng Liu, Fang-Ming Hung, Chenwei Wu, Liyue Shen

TL;DR
This paper introduces MedAction, a pipeline for training medical language models on multi-turn diagnostic trajectories, improving their ability to reason and act under evolving clinical evidence.
Contribution
We develop a novel distillation pipeline and dataset for active multi-turn clinical diagnosis, addressing current models' failure modes and enhancing open-source medical LLM performance.
Findings
Fine-tuning on MedAction-32K improves diagnostic reasoning accuracy.
MedAction-8B achieves state-of-the-art results on MedR-Bench.
Proposed metrics effectively filter high-quality diagnostic trajectories.
Abstract
Most existing LLM diagnoses are evaluated on static, single-turn settings where complete patient information is provided upfront, an oversimplification of real clinical practice. We study active diagnosis: the real-life clinical process of starting from initial observation, ordering tests, interpreting results, and updating a differential diagnosis across multiple turns. Through systematic analysis, we identify three recurring failure modes in current LLMs: ungrounded test ordering, unreliable diagnostic update, and degraded multi-turn coherence. Together, these failures reveal a core deficit: existing medical training data teaches models to reason from complete information but not to act under evolving, partial evidence. To address this gap, we introduce MedAction, a tree-structured distillation pipeline that synthesizes diverse and high-quality multi-turn diagnostic trajectories via…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
