Encoding Domain Information with Sparse Priors for Inferring Explainable   Latent Variables

Arber Qoku; Florian Buettner

arXiv:2107.03730·stat.ML·April 12, 2022

Encoding Domain Information with Sparse Priors for Inferring Explainable Latent Variables

Arber Qoku, Florian Buettner

PDF

Open Access 1 Repo

TL;DR

This paper introduces spex-LVM, a sparse prior-based latent variable model that enhances interpretability by integrating domain knowledge, effectively uncovering meaningful biological factors in high-dimensional data like single-cell RNA-seq.

Contribution

The paper presents spex-LVM, a novel factorial latent variable model that incorporates sparse priors and domain knowledge to produce explainable and interpretable latent factors.

Findings

01

Robustly identifies relevant biological structures.

02

Distinguishes technical noise from true variation.

03

Adapts pathway annotations to specific datasets.

Abstract

Latent variable models are powerful statistical tools that can uncover relevant variation between patients or cells, by inferring unobserved hidden states from observable high-dimensional data. A major shortcoming of current methods, however, is their inability to learn sparse and interpretable hidden states. Additionally, in settings where partial knowledge on the latent structure of the data is readily available, a statistically sound integration of prior information into current methods is challenging. To address these issues, we propose spex-LVM, a factorial latent variable model with sparse priors to encourage the inference of explainable factors driven by domain-relevant information. spex-LVM utilizes existing knowledge of curated biomedical pathways to automatically assign annotated attributes to latent factors, yielding interpretable results tailored to the corresponding domain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MLO-lab/spexlvm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSingle-cell and spatial transcriptomics · Gaussian Processes and Bayesian Inference · Machine Learning and Data Classification