Scalable Amortized GPLVMs for Single Cell Transcriptomics Data

Sarah Zhao; Aditya Ravuri; Vidhi Lalchand; Neil D. Lawrence

arXiv:2405.03879·stat.ML·May 8, 2024

Scalable Amortized GPLVMs for Single Cell Transcriptomics Data

Sarah Zhao, Aditya Ravuri, Vidhi Lalchand, Neil D. Lawrence

PDF

Open Access

TL;DR

This paper presents a scalable amortized Bayesian GPLVM tailored for single-cell RNA-seq data, improving interpretability and clustering over existing models by incorporating specialized design features.

Contribution

The authors introduce an improved amortized stochastic variational Bayesian GPLVM with specialized components, enhancing scalability and interpretability for single-cell transcriptomics.

Findings

01

Matches performance of scVI on synthetic and real datasets

02

Effectively incorporates cell-cycle and batch effects

03

Reveals interpretable latent structures in immunity data

Abstract

Dimensionality reduction is crucial for analyzing large-scale single-cell RNA-seq data. Gaussian Process Latent Variable Models (GPLVMs) offer an interpretable dimensionality reduction method, but current scalable models lack effectiveness in clustering cell types. We introduce an improved model, the amortized stochastic variational Bayesian GPLVM (BGPLVM), tailored for single-cell RNA-seq with specialized encoder, kernel, and likelihood designs. This model matches the performance of the leading single-cell variational inference (scVI) approach on synthetic and real-world COVID datasets and effectively incorporates cell-cycle and batch information to reveal more interpretable latent structures as we demonstrate on an innate immunity dataset.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSingle-cell and spatial transcriptomics

MethodsGaussian Process · Variational Inference