Factor pre-training in Bayesian multivariate logistic models
Lorenzo Mauri, David B. Dunson

TL;DR
This paper introduces a scalable Bayesian factor pre-estimation method for high-dimensional multivariate logistic regression, improving inference efficiency and accuracy in ecological data analysis.
Contribution
It proposes a novel approach that leverages dimensionality to pre-estimate latent factors, enabling parallel Gaussian approximations for scalable Bayesian inference.
Findings
Posterior concentration properties are established.
Method demonstrates excellent empirical performance in simulations.
Applied successfully to insect biodiversity data in Madagascar.
Abstract
This article focuses on inference in logistic regression for high-dimensional binary outcomes. A popular approach induces dependence across the outcomes by including latent factors in the linear predictor. Bayesian approaches are useful for characterizing uncertainty in inferring the regression coefficients, factors and loadings, while also incorporating hierarchical and shrinkage structure. However, Markov chain Monte Carlo algorithms for posterior computation face challenges in scaling to high-dimensional outcomes. Motivated by applications in ecology, we exploit a blessing of dimensionality to motivate pre-estimation of the latent factors. Conditionally on the factors, the outcomes are modeled via independent logistic regressions. We implement Gaussian approximations in parallel in inferring the posterior on the regression coefficients and loadings, including a simple adjustment to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Multi-Criteria Decision Making · Statistical and Computational Modeling
