Bayesian model selection in the $\mathcal{M}$-open setting --   Approximate posterior inference and probability-proportional-to-size   subsampling for efficient large-scale leave-one-out cross-validation

Riko Kelter

arXiv:2005.13199·stat.AP·May 28, 2020

Bayesian model selection in the $\mathcal{M}$-open setting -- Approximate posterior inference and probability-proportional-to-size subsampling for efficient large-scale leave-one-out cross-validation

Riko Kelter

PDF

Open Access

TL;DR

This paper introduces an efficient Bayesian model comparison method using approximate PSIS-LOO and subsampling, suitable for large-scale, high-dimensional data in psychological research.

Contribution

It provides a tutorial on PSIS-LOO, compares it with other methods, and demonstrates its application and efficiency in large-scale Bayesian model selection.

Findings

01

PSIS-LOO outperforms traditional LOO-CV in computational efficiency.

02

Posterior approximations combined with subsampling enable large-scale model comparison.

03

The method is practical for high-dimensional, big-data Bayesian model evaluation.

Abstract

Comparison of competing statistical models is an essential part of psychological research. From a Bayesian perspective, various approaches to model comparison and selection have been proposed in the literature. However, the applicability of these approaches strongly depends on the assumptions about the model space $M$ , the so-called model view. Furthermore, traditional methods like leave-one-out cross-validation (LOO-CV) estimate the expected log predictive density (ELPD) of a model to investigate how the model generalises out-of-sample, which quickly becomes computationally inefficient when sample size becomes large. Here, we provide a tutorial on approximate Pareto-smoothed importance sampling leave-one-out cross-validation (PSIS-LOO), a computationally efficient method for Bayesian model comparison. First, we discuss several model views and the available Bayesian model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Bayesian Inference · Statistical Methods and Inference · Statistical Methods in Clinical Trials