MedAction: Towards Active Multi-turn Clinical Diagnostic LLMs

Hsin-Ling Hsu; Zizheng Wang; Donghua Zhang; Nai-Chia Chen; Jerry Wang; Jun-En Ding; Chia-Hsuan Hsu; Guoan Wang; Feng Liu; Fang-Ming Hung; Chenwei Wu; Liyue Shen

arXiv:2605.07305·cs.CL·May 11, 2026

MedAction: Towards Active Multi-turn Clinical Diagnostic LLMs

Hsin-Ling Hsu, Zizheng Wang, Donghua Zhang, Nai-Chia Chen, Jerry Wang, Jun-En Ding, Chia-Hsuan Hsu, Guoan Wang, Feng Liu, Fang-Ming Hung, Chenwei Wu, Liyue Shen

PDF

TL;DR

This paper introduces MedAction, a pipeline for training medical language models on multi-turn diagnostic trajectories, improving their ability to reason and act under evolving clinical evidence.

Contribution

We develop a novel distillation pipeline and dataset for active multi-turn clinical diagnosis, addressing current models' failure modes and enhancing open-source medical LLM performance.

Findings

01

Fine-tuning on MedAction-32K improves diagnostic reasoning accuracy.

02

MedAction-8B achieves state-of-the-art results on MedR-Bench.

03

Proposed metrics effectively filter high-quality diagnostic trajectories.

Abstract

Most existing LLM diagnoses are evaluated on static, single-turn settings where complete patient information is provided upfront, an oversimplification of real clinical practice. We study active diagnosis: the real-life clinical process of starting from initial observation, ordering tests, interpreting results, and updating a differential diagnosis across multiple turns. Through systematic analysis, we identify three recurring failure modes in current LLMs: ungrounded test ordering, unreliable diagnostic update, and degraded multi-turn coherence. Together, these failures reveal a core deficit: existing medical training data teaches models to reason from complete information but not to act under evolving, partial evidence. To address this gap, we introduce MedAction, a tree-structured distillation pipeline that synthesizes diverse and high-quality multi-turn diagnostic trajectories via…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.