scCross: efficient search for rare subpopulations across multiple single-cell samples
Alexander Gerniers, Siegfried Nijssen, Pierre Dupont

TL;DR
scCross is a new method for finding rare cell types across multiple single-cell datasets without needing to merge the data first, making it more accurate despite batch effects.
Contribution
scCross introduces a biclustering approach to identify rare subpopulations across samples without prior data integration.
Findings
scCross identified a cilium subpopulation with potential new ciliary genes in lung cancer cells.
It successfully detected rare subpopulations in human pancreas samples with different sequencing protocols.
scCross outperformed alternatives in identifying rare cell types with artificial batch effects.
Abstract
Identifying rare cell types is an important task to capture the heterogeneity of single-cell data, such as scRNA-seq. The widespread availability of such data enables to aggregate multiple samples, corresponding for example to different donors, into the same study. Yet, such aggregated data is often subject to batch effects between samples. Clustering it therefore generally requires the use of data integration methods, which can lead to overcorrection, making the identification of rare cells difficult. We present scCross, a biclustering method identifying rare subpopulations of cells present across multiple single-cell samples. It jointly identifies a group of cells with specific marker genes by relying on a global sum criterion, computed over entire subpopulation of cells, rather than pairwise comparisons between individual cells. This proves robust with respect to the high variability…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSingle-cell and spatial transcriptomics · Wastewater Treatment and Nitrogen Removal
