MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Language Models

Jianxin Lin; Chunzheng Zhu; Peter J. Kneuertz; Yunfei Bai; Yuan Xue

arXiv:2603.23085·cs.AI·March 25, 2026

MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Language Models

Jianxin Lin, Chunzheng Zhu, Peter J. Kneuertz, Yunfei Bai, Yuan Xue

PDF

Open Access

TL;DR

MedCausalX introduces an adaptive causal reasoning framework for medical vision-language models, enhancing diagnostic accuracy and reliability by explicitly modeling causal chains and verifying reasoning through a novel two-stage reflection architecture.

Contribution

The paper presents MedCausalX, a novel end-to-end framework that explicitly models causal reasoning in medical VLMs, including a new dataset, a two-stage reflection architecture, and a trajectory-level causal correction method.

Findings

01

Outperforms state-of-the-art methods in medical diagnosis tasks.

02

Improves diagnostic consistency by +5.4 points.

03

Reduces hallucination in model outputs by over 10 points.

Abstract

Vision-Language Models (VLMs) have enabled interpretable medical diagnosis by integrating visual perception with linguistic reasoning. Yet, existing medical chain-of-thought (CoT) models lack explicit mechanisms to represent and enforce causal reasoning, leaving them vulnerable to spurious correlations and limiting their clinical reliability. We pinpoint three core challenges in medical CoT reasoning: how to adaptively trigger causal correction, construct high-quality causal-spurious contrastive samples, and maintain causal consistency across reasoning trajectories. To address these challenges, we propose MedCausalX, an end-to-end framework explicitly models causal reasoning chains in medical VLMs. We first introduce the CRMed dataset providing fine-grained anatomical annotations, structured causal reasoning chains, and counterfactual variants that guide the learning of causal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI) · Domain Adaptation and Few-Shot Learning