eDOC: Explainable Decoding Out-of-domain Cell Types with Evidential Learning
Chaochen Wu, Meiyun Zuo, Lei Xie

TL;DR
eDOC is a novel transformer-based method that accurately identifies in-domain and out-of-domain cell types in single-cell RNA-seq data, quantifies uncertainty, and highlights gene drivers, advancing cell annotation and biological insight.
Contribution
This paper introduces eDOC, a transformer and evidential learning-based approach for improved out-of-domain cell type detection and gene driver identification in scRNA-seq analysis.
Findings
eDOC outperforms existing methods in OOD cell type detection
eDOC effectively identifies gene drivers for both IND and OOD cells
eDOC enhances understanding of gene regulatory mechanisms
Abstract
Single-cell RNA-seq (scRNA-seq) technology is a powerful tool for unraveling the complexity of biological systems. One of essential and fundamental tasks in scRNA-seq data analysis is Cell Type Annotation (CTA). In spite of tremendous efforts in developing machine learning methods for this problem, several challenges remains. They include identifying Out-of-Domain (OOD) cell types, quantifying the uncertainty of unseen cell type annotations, and determining interpretable cell type-specific gene drivers for an OOD case. OOD cell types are often associated with therapeutic responses and disease origins, making them critical for precision medicine and early disease diagnosis. Additionally, scRNA-seq data contains tens thousands of gene expressions. Pinpointing gene drivers underlying CTA can provide deep insight into gene regulatory mechanisms and serve as disease biomarkers. In this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Digital Media Forensic Detection
