Learning interpretable representations of single-cell multi-omics data with multi-output Gaussian processes
Zahra Moslehi, Sareh AmeriFar, Kevin de Azevedo, Florian Buettner

TL;DR
This paper introduces a new method for analyzing single-cell data that balances powerful representation learning with interpretability.
Contribution
A novel framework combining expressive embeddings with interpretable Gaussian processes for multi-omics data.
Findings
The model learns distinct representations for cells and genes from multi-modal data.
Interpretable latent dimensions effectively capture the data's underlying structure.
Gene relevance maps connect cell clusters with their marker genes in latent space.
Abstract
Learning representations of single-cell genomics data is challenging due to the nonlinear and often multi-modal nature of the data on one hand and the need for interpretable representations on the other hand. Existing approaches tend to focus either on interpretability aspects via linear matrix factorization or on maximizing expressive power via neural network-based embeddings using black-box variational autoencoders or graph embedding approaches. We address this trade-off between expressive power and interpretability by introducing a novel approach that combines highly expressive representation learning via an embedding layer with interpretable multi-output Gaussian processes within a unified framework. In our model, we learn distinct representations for samples (cells) and features (genes) from multi-modal single-cell data. We demonstrate that even a few interpretable latent…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 10
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 11Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetabolomics and Mass Spectrometry Studies · Single-cell and spatial transcriptomics · Gene Regulatory Network Analysis
