sc-OTGM: Single-Cell Perturbation Modeling by Solving Optimal Mass Transport on the Manifold of Gaussian Mixtures
Andac Demir, Elizaveta Solovyeva, James Boylan, Mei Xiao, Fabrizio, Serluca, Sebastian Hoersch, Jeremy Jenkins, Murthy Devarakonda, Bulent, Kiziltan

TL;DR
sc-OTGM is a compact, unsupervised single-cell modeling approach that leverages optimal mass transport on Gaussian mixture manifolds to classify cell states, analyze gene perturbations, and generate synthetic data, outperforming larger models.
Contribution
This paper introduces sc-OTGM, a novel, highly efficient model that applies optimal mass transport on Gaussian mixture models for single-cell data analysis, with significantly fewer parameters than existing foundation models.
Findings
Effective in cell state classification on perturbation data
Assists in differential gene expression analysis
Predicts effects of gene perturbations and generates synthetic data
Abstract
Influenced by breakthroughs in LLMs, single-cell foundation models are emerging. While these models show successful performance in cell type clustering, phenotype classification, and gene perturbation response prediction, it remains to be seen if a simpler model could achieve comparable or better results, especially with limited data. This is important, as the quantity and quality of single-cell data typically fall short of the standards in textual data used for training LLMs. Single-cell sequencing often suffers from technical artifacts, dropout events, and batch effects. These challenges are compounded in a weakly supervised setting, where the labels of cell states can be noisy, further complicating the analysis. To tackle these challenges, we present sc-OTGM, streamlined with less than 500K parameters, making it approximately 100x more compact than the foundation models, offering an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFlow Measurement and Analysis · Air Quality Monitoring and Forecasting
MethodsDropout
