SToFM: a Multi-scale Foundation Model for Spatial Transcriptomics
Suyuan Zhao, Yizhen Luo, Ganbo Yang, Yan Zhong, Hao Zhou, Zaiqing Nie

TL;DR
SToFM is a novel multi-scale foundation model for spatial transcriptomics that integrates macro, micro, and gene-scale data to improve tissue analysis and cell characterization.
Contribution
The paper introduces SToFM, a multi-scale spatial transcriptomics foundation model utilizing SE(2) Transformer and a large pretraining corpus, advancing ST data analysis.
Findings
Enhanced tissue region segmentation accuracy
Improved cell type annotation performance
Effective multi-scale information integration
Abstract
Spatial Transcriptomics (ST) technologies provide biologists with rich insights into single-cell biology by preserving spatial context of cells. Building foundational models for ST can significantly enhance the analysis of vast and complex data sources, unlocking new perspectives on the intricacies of biological tissues. However, modeling ST data is inherently challenging due to the need to extract multi-scale information from tissue slices containing vast numbers of cells. This process requires integrating macro-scale tissue morphology, micro-scale cellular microenvironment, and gene-scale gene expression profile. To address this challenge, we propose SToFM, a multi-scale Spatial Transcriptomics Foundation Model. SToFM first performs multi-scale information extraction on each ST slice, to construct a set of ST sub-slices that aggregate macro-, micro- and gene-scale information. Then an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Cancer-related molecular mechanisms research · Single-cell and spatial transcriptomics
