MUST: Modality-Specific Representation-Aware Transformer for Diffusion-Enhanced Survival Prediction with Missing Modality
Kyungwon Kim, Dosik Hwang

TL;DR
MUST is a novel transformer-based framework that explicitly models modality-specific and shared information in multimodal medical data to improve survival prediction, especially with missing modalities.
Contribution
It introduces a modality decomposition approach with algebraic constraints and uses diffusion models for missing modality generation, advancing survival prediction methods.
Findings
Achieves state-of-the-art results on five TCGA datasets.
Maintains robust predictions with missing modalities.
Operates with clinically acceptable inference latency.
Abstract
Accurate survival prediction from multimodal medical data is essential for precision oncology, yet clinical deployment faces a persistent challenge: modalities are frequently incomplete due to cost constraints, technical limitations, or retrospective data availability. While recent methods attempt to address missing modalities through feature alignment or joint distribution learning, they fundamentally lack explicit modeling of the unique contributions of each modality as opposed to the information derivable from other modalities. We propose MUST (Modality-Specific representation-aware Transformer), a novel framework that explicitly decomposes each modality's representation into modality-specific and cross-modal contextualized components through algebraic constraints in a learned low-rank shared subspace. This decomposition enables precise identification of what information is lost when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
