Sparse Linear Identifiable Multivariate Modeling

Ricardo Henao; Ole Winther

arXiv:1004.5265·stat.ML·June 24, 2011·J. Mach. Learn. Res.·28 cites

Sparse Linear Identifiable Multivariate Modeling

Ricardo Henao, Ole Winther

PDF

Open Access

TL;DR

This paper introduces SLIM, a Bayesian framework for sparse, identifiable multivariate models that efficiently infer latent structures and outperform existing methods like LiNGAM in biological data analysis.

Contribution

It proposes a novel Bayesian hierarchical approach with stochastic search for sparse, identifiable linear models, extending to non-linear and correlated latent variable models.

Findings

01

SLIM performs as well or better than LiNGAM in experiments.

02

The method is computationally efficient and scalable.

03

Extensions include non-linear and correlated latent variable models.

Abstract

In this paper we consider sparse and identifiable linear latent variable (factor) and linear Bayesian network models for parsimonious analysis of multivariate data. We propose a computationally efficient method for joint parameter and model inference, and model comparison. It consists of a fully Bayesian hierarchy for sparse models using slab and spike priors (two-component delta-function and continuous mixtures), non-Gaussian latent factors and a stochastic search over the ordering of the variables. The framework, which we call SLIM (Sparse Linear Identifiable Multivariate modeling), is validated and bench-marked on artificial and real biological data sets. SLIM is closest in spirit to LiNGAM (Shimizu et al., 2006), but differs substantially in inference, Bayesian network structure learning and model comparison. Experimentally, SLIM performs equally well or better than LiNGAM with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Gene expression and cancer classification · Statistical Methods and Inference