Multimodal Cross-Task Interaction for Survival Analysis in Whole Slide Pathological Images
Songhan Jiang, Zhengyu Gan, Linghan Cai, Yifeng Wang, Yongbing Zhang

TL;DR
This paper introduces a novel multimodal framework that enhances survival analysis in cancer by effectively integrating pathological images and genomic data through cross-task interactions and optimal transport-based attention.
Contribution
The proposed MCTI framework uniquely models the correlation between classification and survival tasks, improving multimodal data integration without information loss.
Findings
MCTI outperforms existing methods on three benchmarks.
Effective tumor region mining via subtype classification.
Adaptive gene grouping improves genomic feature extraction.
Abstract
Survival prediction, utilizing pathological images and genomic profiles, is increasingly important in cancer analysis and prognosis. Despite significant progress, precise survival analysis still faces two main challenges: (1) The massive pixels contained in whole slide images (WSIs) complicate the process of pathological images, making it difficult to generate an effective representation of the tumor microenvironment (TME). (2) Existing multimodal methods often rely on alignment strategies to integrate complementary information, which may lead to information loss due to the inherent heterogeneity between pathology and genes. In this paper, we propose a Multimodal Cross-Task Interaction (MCTI) framework to explore the intrinsic correlations between subtype classification and survival analysis tasks. Specifically, to capture TME-related features in WSIs, we leverage the subtype…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Brain Tumor Detection and Classification · Radiomics and Machine Learning in Medical Imaging
MethodsSoftmax · Attention Is All You Need · Linear Layer · Multi-Head Attention
