Conditional Evidence Reconstruction and Decomposition for Interpretable Multimodal Diagnosis

Shaowen Wan; Yanjun Lv; Lu Zhang; Dajiang Zhu; Bharat Biswal; Tianming Liu; Xiaobo Li; and Lin Zhao

arXiv:2604.17030·cs.CV·April 21, 2026

Conditional Evidence Reconstruction and Decomposition for Interpretable Multimodal Diagnosis

Shaowen Wan, Yanjun Lv, Lu Zhang, Dajiang Zhu, Bharat Biswal, Tianming Liu, Xiaobo Li, and Lin Zhao

PDF

TL;DR

This paper introduces CERD, a novel framework for interpretable multimodal diagnosis that reconstructs missing data and decomposes evidence, improving robustness and interpretability in incomplete modality scenarios.

Contribution

CERD is the first method to reconstruct missing modalities conditioned on observed data and decompose evidence into shared and modality-specific cues for better interpretability.

Findings

01

CERD outperforms baselines in incomplete-modality settings on ADNI data.

02

CERD provides structured, clinically aligned evidence attributions.

03

CERD enhances trustworthiness of multimodal diagnosis models.

Abstract

Neurobiological and neurodegenerative diseases are inherently multifactorial, arising from coupled influences spanning genetic susceptibility, brain alterations, and environmental and behavioral factors. Multimodal modeling has therefore been increasingly adopted for disease diagnosis by integrating complementary evidence across data sources. However, in both large-scale cohorts and real-world clinical workflows, modality coverage is often incomplete, making many multimodal models brittle when one or more modalities are unavailable. Existing approaches to incomplete multimodal diagnosis typically rely on group-wise or static priors, which may fail to capture subject-specific cross-modal dependencies; moreover, many models provide limited interpretability into which evidence sources drive the final decision. To address these limitations, we propose Conditional Evidence Reconstruction and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.