Prototype-driven fusion of pathology and spatial transcriptomics for interpretable survival prediction
Lihe Liu, Xiaoxi Pan, Yinyin Yuan, Lulu Shang

TL;DR
PathoSpatial is an interpretable multimodal framework that combines pathology images and spatial transcriptomics to improve survival prediction and provide biological insights.
Contribution
It introduces a novel prototype-driven fusion method for integrating WSIs and ST, enhancing interpretability and prognostic accuracy.
Findings
Achieves superior or comparable survival prediction performance.
Enables post-hoc interpretability and molecular risk decomposition.
Demonstrates effectiveness on a triple-negative breast cancer cohort.
Abstract
Whole slide images (WSIs) enable weakly supervised prognostic modeling via multiple instance learning (MIL). Spatial transcriptomics (ST) preserves in situ gene expression, providing a spatial molecular context that complements morphology. As paired WSI-ST cohorts scale to population level, leveraging their complementary spatial signals for prognosis becomes crucial; however, principled cross-modal fusion strategies remain limited for this paradigm. To this end, we introduce PathoSpatial, an interpretable end-to-end framework integrating co-registered WSIs and ST to learn spatially informed prognostic representations. PathoSpatial uses task-guided prototype learning within a multi-level experts architecture, adaptively orchestrating unsupervised within-modality discovery with supervised cross-modal aggregation. By design, PathoSpatial substantially strengthens interpretability while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Single-cell and spatial transcriptomics · Cell Image Analysis Techniques
