MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Language Models
Jianxin Lin, Chunzheng Zhu, Peter J. Kneuertz, Yunfei Bai, Yuan Xue

TL;DR
MedCausalX introduces an adaptive causal reasoning framework for medical vision-language models, enhancing diagnostic accuracy and reliability by explicitly modeling causal chains and verifying reasoning through a novel two-stage reflection architecture.
Contribution
The paper presents MedCausalX, a novel end-to-end framework that explicitly models causal reasoning in medical VLMs, including a new dataset, a two-stage reflection architecture, and a trajectory-level causal correction method.
Findings
Outperforms state-of-the-art methods in medical diagnosis tasks.
Improves diagnostic consistency by +5.4 points.
Reduces hallucination in model outputs by over 10 points.
Abstract
Vision-Language Models (VLMs) have enabled interpretable medical diagnosis by integrating visual perception with linguistic reasoning. Yet, existing medical chain-of-thought (CoT) models lack explicit mechanisms to represent and enforce causal reasoning, leaving them vulnerable to spurious correlations and limiting their clinical reliability. We pinpoint three core challenges in medical CoT reasoning: how to adaptively trigger causal correction, construct high-quality causal-spurious contrastive samples, and maintain causal consistency across reasoning trajectories. To address these challenges, we propose MedCausalX, an end-to-end framework explicitly models causal reasoning chains in medical VLMs. We first introduce the CRMed dataset providing fine-grained anatomical annotations, structured causal reasoning chains, and counterfactual variants that guide the learning of causal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI) · Domain Adaptation and Few-Shot Learning
