A Large-Scale Comparative Analysis of Imputation Methods for Single-Cell RNA Sequencing Data

Yuichiro Iwashita; Ahtisham Fazeel Abbasi; Koichi Kise; Andreas Dengel; Muhammad Nabeel Asim

arXiv:2603.24626·q-bio.GN·April 15, 2026

A Large-Scale Comparative Analysis of Imputation Methods for Single-Cell RNA Sequencing Data

Yuichiro Iwashita, Ahtisham Fazeel Abbasi, Koichi Kise, Andreas Dengel, Muhammad Nabeel Asim

PDF

TL;DR

This study comprehensively benchmarks 15 imputation methods for single-cell RNA sequencing data, revealing traditional methods often outperform deep learning approaches across various datasets and analyses.

Contribution

It provides a large-scale comparison of diverse imputation techniques, highlighting the importance of task-specific evaluation in scRNA-seq data analysis.

Findings

01

Traditional methods generally outperform deep learning methods.

02

Strong numerical recovery does not always improve downstream biological analyses.

03

Method performance varies across datasets and analysis types.

Abstract

Background: Single-cell RNA sequencing (scRNA-seq) enables gene expression profiling at cellular resolution but is inherently affected by sparsity caused by dropout events, where expressed genes are recorded as zeros due to technical limitations. These artifacts distort gene expression distributions and compromise downstream analyses. Numerous imputation methods have been proposed to recover latent transcriptional signals. These methods range from traditional statistical models to deep learning (DL)-based methods. However, their comparative performance remains unclear, as existing benchmarks evaluate only a limited subset of methods, datasets, and downstream analyses. Results: We present a comprehensive benchmark of 15 scRNA-seq imputation methods spanning 7 methodological categories, including traditional and DL-based methods. Methods are evaluated across 30 datasets from 10…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.