Bridging the Modality Bottleneck in Pathology MIL through Virtual Molecular Staining

Yucheng Xing; Pei Liu; Jingying Ma; Ruping Hong; Jiangdong Qiu; Tianyu Liu; Kai He; Ling Huang; Mengling Feng

arXiv:2605.16392·q-bio.QM·May 19, 2026

Bridging the Modality Bottleneck in Pathology MIL through Virtual Molecular Staining

Yucheng Xing, Pei Liu, Jingying Ma, Ruping Hong, Jiangdong Qiu, Tianyu Liu, Kai He, Ling Huang, Mengling Feng

PDF

TL;DR

The paper introduces MIST, a virtual molecular staining method that enhances pathology MIL models by incorporating molecular information during training without requiring transcriptomics at inference.

Contribution

MIST replaces the standard projection layer in MIL with a molecularly informed module trained on spatial transcriptomics, improving performance across multiple tasks.

Findings

01

MIST improves 240 of 256 configurations over standard projection layers.

02

Average gain of +3.5% across tasks, with +5.2% on survival prediction.

03

Gene-derived prototypes are key to the observed performance improvements.

Abstract

Multiple instance learning (MIL) is the dominant framework for whole-slide image analysis in computational pathology, typically combining a frozen patch encoder, a projection layer, and a slide-level aggregator. While encoders and aggregators have been extensively studied, the projection layer remains a largely morphology-only bottleneck. This limits endpoints such as biomarker status and survival, which are governed by a molecular state that is not fully captured by H&E morphology. We introduce Molecularly Informed Staining Transform (MIST), a plug-in replacement for the MIL projection layer that uses paired spatial transcriptomics only during training to construct virtual molecular stains. MIST clusters gene expression profiles into cross-modal prototypes, anchors them in the frozen foundation model feature space, and uses them to reorganize H&E patch features along molecularly guided…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.