Adaptive sequential Monte Carlo for structured cross validation in Bayesian hierarchical models
Geonhee Han, Andrew Gelman

TL;DR
This paper introduces an adaptive sequential Monte Carlo method tailored for structured cross validation in Bayesian hierarchical models, reducing the need for costly re-estimation and accommodating diverse CV schemes.
Contribution
It develops a novel adaptive SMC approach that efficiently traces posterior paths for case-deletion in hierarchical models, automating and streamlining cross validation workflows.
Findings
Effective in longitudinal leave-group-out CV
Supports diverse CV designs including K-fold and sequential validation
Reduces MCMC re-runs and computational costs
Abstract
Importance sampling (IS) is commonly used for cross validation (CV) in Bayesian models, because it only involves reweighting existing posterior draws without needing to re-estimate the model by re-running Markov chain Monte Carlo (MCMC). For hierarchical models, standard IS can be unreliable; the out-of-sample generalization hypothesis may involve structured case-deletion schemes which significantly alter the posterior geometry. This can force costly MCMC re-runs and make CV impractical. As a principled alternative, we tailor adaptive sequential Monte Carlo to sample along a path of posteriors that leads to the case-deleted posterior. The sampler is designed to support various hypotheses by accommodating diverse CV designs, and to streamline the workflow by automating path construction and systematically minimizing MCMC intervention. We demonstrate its utility with three types of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Statistical Methods and Inference
