Bridging the Modality Bottleneck in Pathology MIL through Virtual Molecular Staining
Yucheng Xing, Pei Liu, Jingying Ma, Ruping Hong, Jiangdong Qiu, Tianyu Liu, Kai He, Ling Huang, Mengling Feng

TL;DR
The paper introduces MIST, a virtual molecular staining method that enhances pathology MIL models by incorporating molecular information during training without requiring transcriptomics at inference.
Contribution
MIST replaces the standard projection layer in MIL with a molecularly informed module trained on spatial transcriptomics, improving performance across multiple tasks.
Findings
MIST improves 240 of 256 configurations over standard projection layers.
Average gain of +3.5% across tasks, with +5.2% on survival prediction.
Gene-derived prototypes are key to the observed performance improvements.
Abstract
Multiple instance learning (MIL) is the dominant framework for whole-slide image analysis in computational pathology, typically combining a frozen patch encoder, a projection layer, and a slide-level aggregator. While encoders and aggregators have been extensively studied, the projection layer remains a largely morphology-only bottleneck. This limits endpoints such as biomarker status and survival, which are governed by a molecular state that is not fully captured by H&E morphology. We introduce Molecularly Informed Staining Transform (MIST), a plug-in replacement for the MIL projection layer that uses paired spatial transcriptomics only during training to construct virtual molecular stains. MIST clusters gene expression profiles into cross-modal prototypes, anchors them in the frozen foundation model feature space, and uses them to reorganize H&E patch features along molecularly guided…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
