Cross-validatory model selection for Bayesian autoregressions with exogenous regressors
Alex Cooper, Dan Simpson, Lauren Kennedy, Catherine Forbes, Aki, Vehtari

TL;DR
This paper investigates Bayesian cross-validation methods for autoregressive models with exogenous regressors, highlighting how design choices affect model selection accuracy in finite samples and proposing improved scoring strategies.
Contribution
It demonstrates that joint density scores outperform pointwise scores in finite samples for ARX models and provides finite-sample distribution results and practical recommendations.
Findings
Joint density scores have lower variance than pointwise scores.
Proper CV design reduces adverse selection in model choice.
Finite-sample distributions of CV estimators are derived for ARX models.
Abstract
Bayesian cross-validation (CV) is a popular method for predictive model assessment that is simple to implement and broadly applicable. A wide range of CV schemes is available for time series applications, including generic leave-one-out (LOO) and K-fold methods, as well as specialized approaches intended to deal with serial dependence such as leave-future-out (LFO), h-block, and hv-block. Existing large-sample results show that both specialized and generic methods are applicable to models of serially-dependent data. However, large sample consistency results overlook the impact of sampling variability on accuracy in finite samples. Moreover, the accuracy of a CV scheme depends on many aspects of the procedure. We show that poor design choices can lead to elevated rates of adverse selection. In this paper, we consider the problem of identifying the regression component of an important…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Advanced Statistical Process Monitoring · Statistical Methods and Bayesian Inference
