Revealing the Best Strategies for Rare Cell Type Detection in Multi-Sample Single-Cell Datasets
Zhiwei Ye, Yinqiao Yan, Yuanyuan Yu, Hao Wu

TL;DR
This study compares strategies for detecting rare cell types in multi-sample single-cell RNA sequencing data, finding that batch-corrected pooled analysis works best.
Contribution
The study systematically benchmarks rare cell detection methods in multi-sample settings and identifies optimal analytical strategies.
Findings
Batch-corrected pooled sample detection outperformed other strategies across methods and datasets.
scCAD showed the most robust and stable performance among evaluated tools.
Abstract
Background: Single-cell RNA sequencing (scRNA-seq) enables high-resolution characterization of cellular heterogeneity and provides unique opportunities to identify rare cell populations that may be obscured in bulk transcriptomic data. However, despite the growing interest in rare-cell discovery, most existing detection methods were originally developed for single-sample datasets, and their behavior in multi-sample settings—where batch effects, sample imbalance, and heterogeneous cell-type compositions are common—remains poorly understood. This study aims to systematically evaluate representative rare cell detection methods under multi-sample settings and identify the most effective analytical strategies. Methods: We performed a comprehensive benchmarking analysis of five widely used rare cell detection tools, CellSIUS, GapClust, GiniClust, scCAD, SCISSORS and a scGPT-based rare cell…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSingle-cell and spatial transcriptomics · Cancer Genomics and Diagnostics · Cell Image Analysis Techniques
