Comparison of Bayesian predictive methods for model selection
Juho Piironen, Aki Vehtari

TL;DR
This paper compares Bayesian model selection methods for regression and classification, highlighting the advantages of Bayesian model averaging and projection methods over traditional CV-based selection, especially with limited data.
Contribution
It provides a comprehensive comparison of Bayesian model selection techniques, emphasizing the robustness of Bayesian model averaging and projection methods in practical scenarios.
Findings
Bayesian model averaging outperforms other methods in predictive accuracy.
Projection methods reduce overfitting compared to CV-based selection.
Using cross-validation outside the search improves model assessment.
Abstract
The goal of this paper is to compare several widely used Bayesian model selection methods in practical model selection problems, highlight their differences and give recommendations about the preferred approaches. We focus on the variable subset selection for regression and classification and perform several numerical experiments using both simulated and real world data. The results show that the optimization of a utility estimate such as the cross-validation (CV) score is liable to finding overfitted models due to relatively high variance in the utility estimates when the data is scarce. This can also lead to substantial selection induced bias and optimism in the performance evaluation for the selected model. From a predictive viewpoint, best results are obtained by accounting for model uncertainty by forming the full encompassing model, such as the Bayesian model averaging solution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
