A versatile informative diffusion model for single-cell ATAC-seq data generation and analysis
Lei Huang, Lei Xiong, Na Sun, Zunpeng Liu, Ka-Chun Wong, and Manolis, Kellis

TL;DR
ATAC-Diff is a novel versatile diffusion model designed for high-quality generation and analysis of single-cell ATAC-seq data, addressing noise and sparsity issues with a unified framework.
Contribution
It introduces the first diffusion-based model for scATAC-seq data, integrating auxiliary modules and mutual information regularization for multi-task versatility.
Findings
Outperforms state-of-the-art models in data generation quality.
Effectively handles multiple analysis tasks with a single framework.
Reduces noise and sparsity in scATAC-seq data.
Abstract
The rapid advancement of single-cell ATAC sequencing (scATAC-seq) technologies holds great promise for investigating the heterogeneity of epigenetic landscapes at the cellular level. The amplification process in scATAC-seq experiments often introduces noise due to dropout events, which results in extreme sparsity that hinders accurate analysis. Consequently, there is a significant demand for the generation of high-quality scATAC-seq data in silico. Furthermore, current methodologies are typically task-specific, lacking a versatile framework capable of handling multiple tasks within a single model. In this work, we propose ATAC-Diff, a versatile framework, which is based on a latent diffusion model conditioned on the latent auxiliary variables to adapt for various tasks. ATAC-Diff is the first diffusion model for the scATAC-seq data generation and analysis, composed of auxiliary modules…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsSingle-cell and spatial transcriptomics
MethodsDropout · Diffusion · Latent Diffusion Model
