scMamba: A Pre-Trained Model for Single-Nucleus RNA Sequencing Analysis in Neurodegenerative Disorders
Gyutaek Oh, Baekgyu Choi, Seyoung Jin, Inkyung Jung, Jong Chul Ye

TL;DR
scMamba is a pre-trained model that enhances single-nucleus RNA sequencing analysis for neurodegenerative diseases by improving data quality and analysis accuracy through a novel architecture and generalizable feature learning.
Contribution
The paper introduces scMamba, a novel pre-trained model with a unique architecture that effectively processes snRNA-seq data without dimension reduction, improving multiple downstream analysis tasks.
Findings
Outperforms benchmark methods in cell type annotation
Improves doublet detection accuracy
Enhances identification of differentially expressed genes
Abstract
Single-nucleus RNA sequencing (snRNA-seq) has significantly advanced our understanding of the disease etiology of neurodegenerative disorders. However, the low quality of specimens derived from postmortem brain tissues, combined with the high variability caused by disease heterogeneity, makes it challenging to integrate snRNA-seq data from multiple sources for precise analyses. To address these challenges, we present scMamba, a pre-trained model designed to improve the quality and utility of snRNA-seq analysis, with a particular focus on neurodegenerative diseases. Inspired by the recent Mamba model, scMamba introduces a novel architecture that incorporates a linear adapter layer, gene embeddings, and bidirectional Mamba blocks, enabling efficient processing of snRNA-seq data while preserving information from the raw input. Notably, scMamba learns generalizable features of cells and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSingle-cell and spatial transcriptomics · RNA regulation and disease · Genomics and Phylogenetic Studies
MethodsMamba: Linear-Time Sequence Modeling with Selective State Spaces · Focus · Adapter
