# Spatial information matters: are traditional imputation methods effective for spatial transcriptomics data?

**Authors:** Fahim Hafiz, Riasat Azim, Swakkhar Shatabda

PMC · DOI: 10.1093/bib/bbag027 · Briefings in Bioinformatics · 2026-02-02

## TL;DR

This paper evaluates imputation methods for spatial transcriptomics data and introduces a new method that uses spatial information to better handle missing data.

## Contribution

A novel imputation method called SpaMean-Impute that leverages spatial information for improved dropout handling in SRT data.

## Key findings

- No single imputation method consistently outperforms others across all SRT datasets.
- SpaMean-Impute outperforms existing methods by 16.15% in ARI and 18.45% in NMI on average.
- The proposed method is computationally efficient, being 33× faster and using 1500 MB less memory than deep-learning-based approaches.

## Abstract

Recent advancements in spatially resolved transcriptomics (SRT) have enabled near single-cell resolution, providing rich spatial context crucial for uncovering biological insights. However, high-resolution SRT datasets remain sparse and prone to dropout events that may impede accurate interpretation. Computational imputation methods are often employed to recover missing values, yet existing state-of-the-art (SOTA) techniques—designed for tabular, single-cell RNA, or general SRT data—have not been systematically benchmarked on datasets produced by newer SRT technologies. In this study, we evaluate seven SOTA imputation methods across five emerging SRT platforms encompassing 23 datasets. Our results reveal that no single method consistently excels, with most struggling to accurately identify valid dropouts. Motivated by these limitations, we introduce `SpaMean-Impute', a novel imputation method tailored for SRT datasets that incorporates spatial information to mitigate dropout effects and detect valid dropouts. Our proposed method outperforms the SOTA imputation methods across evaluation metrics, such as adjusted rand index (ARI), normalized mutual information (NMI), adjusted mutual information (AMI), and homogeneity (HOMO). In case of ARI, the proposed method outperforms the SOTA methods on average 16.15%, whereas 18.45% improvement in NMI, 18.96% in AMI, and 13.98% in the case of HOMO. Furthermore, the proposed method is computationally efficient compared with other SOTA methods. For example, compared with the SOTA deep-learning-based imputation methods, the proposed method is \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{upgreek}
\usepackage{mathrsfs}
\setlength{\oddsidemargin}{-69pt}
\begin{document}
$\sim 33\times $\end{document} faster and requires, on average, 1500 MB less memory during imputation. Moreover, our approach offers notable computational efficiency. Source code, datasets, and benchmarking scripts are available at: https://github.com/FahimHafiz/SpaMean-Impute.

## Full-text entities

- **Genes:** Tbl1xr1 (Tbl1x/y related 1) [NCBI Gene 81004] {aka 8030499H02Rik, A630076E03Rik, C21, C230089I12Rik, DC42, Ira1}, Slc6a3 (solute carrier family 6 (neurotransmitter transporter, dopamine), member 3) [NCBI Gene 13162] {aka DAT, Dat1}, Cx3cl1 (C-X3-C motif chemokine ligand 1) [NCBI Gene 20312] {aka ABCD-3, CX3C, Cxc3, D8Bwg0439e, FK, Scyd1}, Cyp26b1 (cytochrome P450, family 26, subfamily b, polypeptide 1) [NCBI Gene 232174] {aka CP26, P450RAI-2}, Cdc42 (cell division cycle 42) [NCBI Gene 12540]
- **Diseases:** ASD (MESH:D000067877), diabetes (MESH:D003920), cancer (MESH:D009369), AD (MESH:D000544), schizophrenia (MESH:D012559), breast cancer (MESH:D001943), AMI (MESH:D000275), BRCA (MESH:D001941), WT (MESH:D009396)
- **Chemicals:** NaN (-)
- **Species:** Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12862982/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12862982/full.md

## References

58 references — full list in the complete paper: https://tomesphere.com/paper/PMC12862982/full.md

---
Source: https://tomesphere.com/paper/PMC12862982