A subsampling approach for Bayesian model selection
Jon Lachmann, Geir Storvik, Florian Frommlet, Aliaksadr Hubin

TL;DR
This paper introduces a subsampling-based Bayesian model selection method that uses stochastic gradient descent to efficiently estimate marginal likelihoods in large datasets, improving scalability.
Contribution
It proposes a novel subsampling approach combined with MCMC for scalable Bayesian model selection in generalized linear models.
Findings
The method achieves comparable accuracy to traditional approaches.
It significantly reduces computational time on large datasets.
Theoretical convergence guarantees are provided.
Abstract
It is common practice to use Laplace approximations to compute marginal likelihoods in Bayesian versions of generalised linear models (GLM). Marginal likelihoods combined with model priors are then used in different search algorithms to compute the posterior marginal probabilities of models and individual covariates. This allows performing Bayesian model selection and model averaging. For large sample sizes, even the Laplace approximation becomes computationally challenging because the optimisation routine involved needs to evaluate the likelihood on the full set of data in multiple iterations. As a consequence, the algorithm is not scalable for large datasets. To address this problem, we suggest using a version of a popular batch stochastic gradient descent (BSGD) algorithm for estimating the marginal likelihood of a GLM by subsampling from the data. We further combine the algorithm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference · Gaussian Processes and Bayesian Inference
