# stPipe: a flexible and streamlined R/Bioconductor pipeline for preprocessing sequencing-based spatial transcriptomics data

**Authors:** Yang Xu, Callum J Sargeant, Yue You, Yupei You, Shian Su, Changqing Wang, Luyi Tian, Yunshun Chen, Matthew E Ritchie

PMC · DOI: 10.1093/nargab/lqaf167 · NAR Genomics and Bioinformatics · 2025-11-22

## TL;DR

stPipe is a new R/Bioconductor pipeline that simplifies preprocessing of spatial transcriptomics data from multiple platforms, enabling consistent and uniform analysis.

## Contribution

The novel contribution is a modular, open-source pipeline for preprocessing spatial transcriptomics data across multiple platforms in a standardized way.

## Key findings

- stPipe processes raw FASTQ files into spatially resolved gene count matrices.
- It standardizes data storage for compatibility with downstream analysis tools.
- The pipeline facilitates benchmarking of spatial transcriptomics methods using reference datasets.

## Abstract

Spatial transcriptomics technology has developed rapidly in recent years, with various sequencing-based platforms such as 10× Visium, Slide-seq, and Stereo-seq becoming widely used by researchers. Each platform brings its own set of protocols and customized data analysis pipelines, which presents challenges when the goal is to obtain uniformly preprocessed data that is conveniently formatted for downstream analysis. To address the need for simpler, open-source solutions that deal with sequencing-based spatial transcriptomics (sST) data from different platforms, we present stPipe, a comprehensive and modular preprocessing pipeline for all current mainstream sST platforms. stPipe is implemented as an R/Bioconductor package that handles various analysis steps, including (i) data processing from raw FASTQ files to create a spatially resolved gene count matrix; (ii) the collation of relevant quality control metrics to ensure unwanted artifacts can be filtered; and (iii) the adoption of standardized data storage containers to allow results to be easily passed on to a wide range of downstream analysis packages. A key use case for stPipe is in methods benchmarking, and we demonstrate how the uniform processing of sST data collected on reference tissue samples from the cadasSTre and SpatialBenchVisium projects is made easier, allowing comparisons between different technology platforms and downstream analysis tools.

## Full-text entities

- **Genes:** NR1I4 (nuclear receptor subfamily 1 group I member 4) [NCBI Gene 22817] {aka CAR2}, Ttr (transthyretin) [NCBI Gene 22139] {aka prealbumin}, Prdm8 (PR domain containing 8) [NCBI Gene 77630] {aka 4930565F05Rik, PFM5}, Slc17a7 (solute carrier family 17 (sodium-dependent inorganic phosphate cotransporter), member 7) [NCBI Gene 72961] {aka 2900052E22Rik, Vglut1}, Ntf3 (neurotrophin 3) [NCBI Gene 18205] {aka HDNF, NGF-2, Nt3, Ntf-3}, Hpca (hippocalcin) [NCBI Gene 15444], Amigo2 (adhesion molecule with Ig like domain 2) [NCBI Gene 105827] {aka AMIGO-2, Ali1}, Car2 (carbonic anhydrase 2) [NCBI Gene 12349] {aka CAII, Ca2, Car-2, Ltw-5, Lvtw-5}, Ptgds (prostaglandin D2 synthase (brain)) [NCBI Gene 19215] {aka 21kDa, L-PGDS, PGD2, PGDS, PGDS2, Ptgs3}, Car3 (carbonic anhydrase 3) [NCBI Gene 12350] {aka Ca3, Car-3}, Rgs14 (regulator of G-protein signaling 14) [NCBI Gene 51791] {aka RPIP1}, Prox1 (prospero homeobox 1) [NCBI Gene 19130] {aka A230003G05Rik, PROX-1}, Camk4 (calcium/calmodulin-dependent protein kinase IV) [NCBI Gene 12326] {aka A430110E23Rik, CaMKIV, CaMKIV/Gr, D18Bwg0362e}
- **Diseases:** ST (MESH:D008569), malaria infection (MESH:D008288)
- **Chemicals:** formalin (MESH:D005557), poly-A (MESH:D011061), Stereo (-), H&amp;E (MESH:D006371), paraffin (MESH:D010232), polyacrylamide (MESH:C016679)
- **Species:** Homo sapiens (human, species) [taxon 9606], Mus musculus (house mouse, species) [taxon 10090]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12639247/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12639247/full.md

## References

34 references — full list in the complete paper: https://tomesphere.com/paper/PMC12639247/full.md

---
Source: https://tomesphere.com/paper/PMC12639247