Multimodal Prototyping for cancer survival prediction
Andrew H. Song, Richard J. Chen, Guillaume Jaume, Anurag J. Vaidya,, Alexander S. Baras, Faisal Mahmood

TL;DR
This paper introduces a multimodal cancer survival prediction framework that condenses histology images and transcriptomic data into prototypes, enabling efficient, interpretable, and accurate prognostication across multiple cancer types.
Contribution
The study proposes a novel prototype-based summarization method for multimodal data, reducing computational complexity and enhancing interpretability in survival prediction models.
Findings
Outperforms state-of-the-art methods on six cancer types
Achieves over 300x data compression without loss of accuracy
Enables new interpretability analyses of multimodal data
Abstract
Multimodal survival methods combining gigapixel histology whole-slide images (WSIs) and transcriptomic profiles are particularly promising for patient prognostication and stratification. Current approaches involve tokenizing the WSIs into smaller patches (>10,000 patches) and transcriptomics into gene groups, which are then integrated using a Transformer for predicting outcomes. However, this process generates many tokens, which leads to high memory requirements for computing attention and complicates post-hoc interpretability analyses. Instead, we hypothesize that we can: (1) effectively summarize the morphological content of a WSI by condensing its constituting tokens using morphological prototypes, achieving more than 300x compression; and (2) accurately characterize cellular functions by encoding the transcriptomic profile with biological pathway prototypes, all in an unsupervised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies
MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Position-Wise Feed-Forward Layer · Adam · Dense Connections
