MUST: Modality-Specific Representation-Aware Transformer for Diffusion-Enhanced Survival Prediction with Missing Modality

Kyungwon Kim; Dosik Hwang

arXiv:2603.26071·cs.CV·March 30, 2026

MUST: Modality-Specific Representation-Aware Transformer for Diffusion-Enhanced Survival Prediction with Missing Modality

Kyungwon Kim, Dosik Hwang

PDF

TL;DR

MUST is a novel transformer-based framework that explicitly models modality-specific and shared information in multimodal medical data to improve survival prediction, especially with missing modalities.

Contribution

It introduces a modality decomposition approach with algebraic constraints and uses diffusion models for missing modality generation, advancing survival prediction methods.

Findings

01

Achieves state-of-the-art results on five TCGA datasets.

02

Maintains robust predictions with missing modalities.

03

Operates with clinically acceptable inference latency.

Abstract

Accurate survival prediction from multimodal medical data is essential for precision oncology, yet clinical deployment faces a persistent challenge: modalities are frequently incomplete due to cost constraints, technical limitations, or retrospective data availability. While recent methods attempt to address missing modalities through feature alignment or joint distribution learning, they fundamentally lack explicit modeling of the unique contributions of each modality as opposed to the information derivable from other modalities. We propose MUST (Modality-Specific representation-aware Transformer), a novel framework that explicitly decomposes each modality's representation into modality-specific and cross-modal contextualized components through algebraic constraints in a learned low-rank shared subspace. This decomposition enables precise identification of what information is lost when…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.