TL;DR
This paper introduces a novel variational EM-based Bayesian method for simultaneous network estimation and variable selection in compositional count data, demonstrated through simulations and microbiome data analysis.
Contribution
It develops a hierarchical Bayesian model with spike-and-slab priors and a new variational inference scheme for efficient joint network and covariate inference in compositional data.
Findings
Outperforms existing methods in network recovery accuracy
Effective in microbiome data analysis for understanding microbe interactions
Provides an open-source Python implementation called SINC
Abstract
Network estimation and variable selection have been extensively studied in the statistical literature, but only recently have those two challenges been addressed simultaneously. In this paper, we seek to develop a novel method to simultaneously estimate network interactions and associations to relevant covariates for count data, and specifically for compositional data, which have a fixed sum constraint. We use a hierarchical Bayesian model with latent layers and employ spike-and-slab priors for both edge and covariate selection. For posterior inference, we develop a novel variational inference scheme with an expectation maximization step, to enable efficient estimation. Through simulation studies, we demonstrate that the proposed model outperforms existing methods in its accuracy of network recovery. We show the practical utility of our model via an application to microbiome data. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
