From Answers to Arguments: Toward Trustworthy Clinical Diagnostic Reasoning with Toulmin-Guided Curriculum Goal-Conditioned Learning

Chen Zhan; Xiaoyu Tan; Gengchen Ma; Yu-Jie Xiong; Xiaoyan Jiang; Xihe Qiu

arXiv:2604.11137·cs.AI·April 21, 2026

From Answers to Arguments: Toward Trustworthy Clinical Diagnostic Reasoning with Toulmin-Guided Curriculum Goal-Conditioned Learning

Chen Zhan, Xiaoyu Tan, Gengchen Ma, Yu-Jie Xiong, Xiaoyan Jiang, Xihe Qiu

PDF

TL;DR

This paper introduces a Toulmin-guided curriculum learning approach to improve the transparency and reliability of clinical diagnostic reasoning in large language models.

Contribution

It adapts the Toulmin model for clinical argumentation and proposes a novel curriculum training pipeline, CGCL, to enhance diagnostic reasoning in LLMs.

Findings

01

CGCL achieves diagnostic accuracy comparable to reinforcement learning methods.

02

The approach produces more transparent and structured clinical arguments.

03

Training with CGCL is more stable and efficient than resource-intensive methods.

Abstract

The integration of Large Language Models (LLMs) into clinical decision support is critically obstructed by their opaque and often unreliable reasoning. In the high-stakes domain of healthcare, correct answers alone are insufficient; clinical practice demands full transparency to ensure patient safety and enable professional accountability. A pervasive and dangerous weakness of current LLMs is their tendency to produce "correct answers through flawed reasoning." This issue is far more than a minor academic flaw; such process errors signal a fundamental lack of robust understanding, making the model prone to broader hallucinations and unpredictable failures when faced with real-world clinical complexity. In this paper, we establish a framework for trustworthy clinical argumentation by adapting the Toulmin model to the diagnostic process. We propose a novel training pipeline: Curriculum…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.