Spatial Transcriptomics as Images for Large-Scale Pretraining

Yishun Zhu; Jiaxin Qi; Jian Wang; Yuhua Zheng; Jianqiang Huang

arXiv:2603.13432·cs.CV·March 24, 2026

Spatial Transcriptomics as Images for Large-Scale Pretraining

Yishun Zhu, Jiaxin Qi, Jian Wang, Yuhua Zheng, Jianqiang Huang

PDF

Open Access

TL;DR

This paper introduces a novel approach to pretraining spatial transcriptomics data by converting it into image-like patches, which preserves spatial context and enhances downstream analysis performance.

Contribution

The authors propose treating spatial transcriptomics as croppable images with controlled channels, enabling large-scale pretraining and outperforming traditional methods.

Findings

01

Image-like ST representation improves downstream tasks.

02

Cropping patches increases training samples.

03

Channel design enhances pretraining stability.

Abstract

Spatial Transcriptomics (ST) profiles thousands of gene expression values at discrete spots with precise coordinates on tissue sections, preserving spatial context essential for clinical and pathological studies. With rising sequencing throughput and advancing platforms, the expanding data volumes motivate large-scale ST pretraining. However, the fundamental unit for pretraining, i.e., what constitutes a single training sample, remains ill-posed. Existing choices fall into two camps: (1) treating each spot as an independent sample, which discards spatial dependencies and collapses ST into single-cell transcriptomics; and (2) treating an entire slide as a single sample, which produces prohibitively large inputs and drastically fewer training examples, undermining effective pretraining. To address this gap, we propose treating spatial transcriptomics as croppable images. Specifically, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSingle-cell and spatial transcriptomics · Gene expression and cancer classification · Cell Image Analysis Techniques