Bayesian model selection in the $\mathcal{M}$-open setting -- Approximate posterior inference and probability-proportional-to-size subsampling for efficient large-scale leave-one-out cross-validation
Riko Kelter

TL;DR
This paper introduces an efficient Bayesian model comparison method using approximate PSIS-LOO and subsampling, suitable for large-scale, high-dimensional data in psychological research.
Contribution
It provides a tutorial on PSIS-LOO, compares it with other methods, and demonstrates its application and efficiency in large-scale Bayesian model selection.
Findings
PSIS-LOO outperforms traditional LOO-CV in computational efficiency.
Posterior approximations combined with subsampling enable large-scale model comparison.
The method is practical for high-dimensional, big-data Bayesian model evaluation.
Abstract
Comparison of competing statistical models is an essential part of psychological research. From a Bayesian perspective, various approaches to model comparison and selection have been proposed in the literature. However, the applicability of these approaches strongly depends on the assumptions about the model space , the so-called model view. Furthermore, traditional methods like leave-one-out cross-validation (LOO-CV) estimate the expected log predictive density (ELPD) of a model to investigate how the model generalises out-of-sample, which quickly becomes computationally inefficient when sample size becomes large. Here, we provide a tutorial on approximate Pareto-smoothed importance sampling leave-one-out cross-validation (PSIS-LOO), a computationally efficient method for Bayesian model comparison. First, we discuss several model views and the available Bayesian model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Statistical Methods and Inference · Statistical Methods in Clinical Trials
