M2OST: Many-to-one Regression for Predicting Spatial Transcriptomics from Digital Pathology Images
Hongyi Wang, Xiuju Du, Jing Liu, Shuyi Ouyang, Yen-Wei Chen, Lanfen, Lin

TL;DR
M2OST introduces a multi-scale, many-to-one Transformer model that leverages hierarchical pathology image data to accurately predict spatial transcriptomics, outperforming existing methods with fewer resources.
Contribution
The paper presents M2OST, a novel many-to-one regression Transformer that effectively utilizes multi-scale image features and inter-spot information for spatial transcriptomics prediction.
Findings
Achieves state-of-the-art performance on three public datasets.
Uses fewer parameters and FLOPs than existing methods.
Effectively incorporates hierarchical and inter-spot features.
Abstract
The advancement of Spatial Transcriptomics (ST) has facilitated the spatially-aware profiling of gene expressions based on histopathology images. Although ST data offers valuable insights into the micro-environment of tumors, its acquisition cost remains expensive. Therefore, directly predicting the ST expressions from digital pathology images is desired. Current methods usually adopt existing regression backbones along with patch-sampling for this task, which ignores the inherent multi-scale information embedded in the pyramidal data structure of digital pathology images, and wastes the inter-spot visual information crucial for accurate gene expression prediction. To address these limitations, we propose M2OST, a many-to-one regression Transformer that can accommodate the hierarchical structure of the pathology images via a decoupled multi-scale feature extractor. Unlike traditional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAI in cancer detection · Single-cell and spatial transcriptomics · Gene expression and cancer classification
MethodsAttention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Label Smoothing · Byte Pair Encoding · Absolute Position Encodings · Softmax · Layer Normalization · Dropout · Dense Connections
