TL;DR
This paper introduces a scalable variational inference method for joint analysis of high-dimensional genetic and molecular data, enabling efficient and accurate identification of associations in large-scale studies.
Contribution
It develops a variational inference approach for sparse multivariate regression, significantly reducing computation time while maintaining accuracy compared to MCMC methods.
Findings
Outperforms existing variable selection methods.
Handles hundreds of thousands of genetic variants efficiently.
Provides near-MCMC accuracy at a fraction of the computational cost.
Abstract
Combined inference for heterogeneous high-dimensional data is critical in modern biology, where clinical and various kinds of molecular data may be available from a single study. Classical genetic association studies regress a single clinical outcome on many genetic variants one by one, but there is an increasing demand for joint analysis of many molecular outcomes and genetic variants in order to unravel functional interactions. Unfortunately, most existing approaches to joint modelling are either too simplistic to be powerful or are impracticable for computational reasons. Inspired by Richardson et al. (2010, Bayesian Statistics 9), we consider a sparse multivariate regression model that allows simultaneous selection of predictors and associated responses. As Markov chain Monte Carlo (MCMC) inference on such models can be prohibitively slow when the number of genetic variants exceeds…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
