scE2TM improves single-cell embedding interpretability and reveals cellular perturbation signatures
Hegang Chen, Yuyin Lu, Yifan Zhao, Zhiming Dai, Fu Lee Wang, Qing Li, Yanghui Rao, Yue Li

TL;DR
scE2TM is a novel external knowledge-guided embedded topic model that enhances interpretability and biological insight in single-cell RNA-seq data analysis, outperforming existing methods in clustering and biological pathway consistency.
Contribution
The paper introduces scE2TM, a new model that prevents topic interpretation collapse and leverages external biological knowledge for improved single-cell embedding interpretability.
Findings
scE2TM achieves superior clustering performance across 20 datasets.
It produces more diverse and biologically consistent topics.
It accurately models cellular responses and identifies disease-specific gene programs.
Abstract
Single-cell RNA sequencing technologies have revolutionized our understanding of cellular heterogeneity, yet computational methods often struggle to balance performance with biological interpretability. Embedded topic models have been widely used for interpretable single-cell embedding learning. However, these models suffer from the potential problem of interpretation collapse, where topics semantically collapse towards each other, resulting in redundant topics and incomplete capture of biological variation. Furthermore, the rise of single-cell foundation models creates opportunities to harness external biological knowledge for guiding model embeddings. Here, we present scE2TM, an external knowledge-guided embedded topic model that provides a high-quality cell embedding and interpretation for scRNA-seq analysis. Through embedding clustering regularization method, each topic is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSingle-cell and spatial transcriptomics · Cell Image Analysis Techniques · Immune responses and vaccinations
