A New Framework for Explainable Rare Cell Identification in Single-Cell Transcriptomics Data
Di Su, Kai Ming Ting, Jie Zhang, Xiaorui Zhang, Xinpeng Li

TL;DR
This paper introduces a novel framework for identifying and explaining rare cell types in single-cell transcriptomics data, emphasizing interpretability by avoiding PCA and using advanced anomaly detection and explanation methods.
Contribution
It presents a new explainable anomaly detection framework that directly identifies rare cells and provides gene-based explanations without relying on PCA.
Findings
Eliminates the PCA step for better interpretability
Employs state-of-the-art anomaly detection and explanation techniques
Provides visual, gene-based explanations for detected rare cells
Abstract
The detection of rare cell types in single-cell transcriptomics data is crucial for elucidating disease pathogenesis and tissue development dynamics. However, a critical gap that persists in current methods is their inability to provide an explanation based on genes for each cell they have detected as rare. We identify three primary sources of this deficiency. First, the anomaly detectors often function as "black boxes", designed to detect anomalies but unable to explain why a cell is anomalous. Second, the standard analytical framework hinders interpretability by relying on dimensionality reduction techniques, such as Principal Component Analysis (PCA), which transform meaningful gene expression data into abstract, uninterpretable features. Finally, existing explanation algorithms cannot be readily applied to this domain, as single-cell data is characterized by high dimensionality,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSingle-cell and spatial transcriptomics · Gene expression and cancer classification · Bioinformatics and Genomic Networks
