A Large-Scale Comparative Analysis of Imputation Methods for Single-Cell RNA Sequencing Data
Yuichiro Iwashita, Ahtisham Fazeel Abbasi, Koichi Kise, Andreas Dengel, Muhammad Nabeel Asim

TL;DR
This study comprehensively benchmarks 15 imputation methods for single-cell RNA sequencing data, revealing traditional methods often outperform deep learning approaches across various datasets and analyses.
Contribution
It provides a large-scale comparison of diverse imputation techniques, highlighting the importance of task-specific evaluation in scRNA-seq data analysis.
Findings
Traditional methods generally outperform deep learning methods.
Strong numerical recovery does not always improve downstream biological analyses.
Method performance varies across datasets and analysis types.
Abstract
Background: Single-cell RNA sequencing (scRNA-seq) enables gene expression profiling at cellular resolution but is inherently affected by sparsity caused by dropout events, where expressed genes are recorded as zeros due to technical limitations. These artifacts distort gene expression distributions and compromise downstream analyses. Numerous imputation methods have been proposed to recover latent transcriptional signals. These methods range from traditional statistical models to deep learning (DL)-based methods. However, their comparative performance remains unclear, as existing benchmarks evaluate only a limited subset of methods, datasets, and downstream analyses. Results: We present a comprehensive benchmark of 15 scRNA-seq imputation methods spanning 7 methodological categories, including traditional and DL-based methods. Methods are evaluated across 30 datasets from 10…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
