MMIST-ccRCC: A Real World Medical Dataset for the Development of Multi-Modal Systems
Tiago Mota, M. Rita Verdelho, Alceu Bissoto, Carlos Santiago, and Catarina Barata

TL;DR
This paper introduces MMIST-ccRCC, a comprehensive multi-modal dataset for renal cancer, and benchmarks multi-modal fusion methods for survival prediction despite high missing data rates, demonstrating improved performance with modality generation strategies.
Contribution
It provides a real-world multi-modal dataset for ccRCC and evaluates fusion strategies under missing data conditions, advancing multi-modal machine learning in healthcare.
Findings
Fusion of modalities improves survival prediction accuracy.
Latent modality generation enhances performance with missing data.
High missing rates still allow effective multi-modal fusion.
Abstract
The acquisition of different data modalities can enhance our knowledge and understanding of various diseases, paving the way for a more personalized healthcare. Thus, medicine is progressively moving towards the generation of massive amounts of multi-modal data (\emph{e.g,} molecular, radiology, and histopathology). While this may seem like an ideal environment to capitalize data-centric machine learning approaches, most methods still focus on exploring a single or a pair of modalities due to a variety of reasons: i) lack of ready to use curated datasets; ii) difficulty in identifying the best multi-modal fusion strategy; and iii) missing modalities across patients. In this paper we introduce a real world multi-modal dataset called MMIST-CCRCC that comprises 2 radiology modalities (CT and MRI), histopathology, genomics, and clinical data from 618 patients with clear cell renal cell…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare
MethodsFocus
