A Note on the Finite Sample Bias in Time Series Cross-Validation
Amaze Lusompa

TL;DR
This paper demonstrates that cross-validation for time series models, including VARs and martingale-like errors, inherently contains bias, challenging assumptions of bias-free model selection in such contexts.
Contribution
It provides a theoretical analysis showing that cross-validation bias persists in time series models, even with structures previously thought to mitigate it.
Findings
Cross-validation introduces bias in time series model selection.
Bias persists even with VAR and martingale error structures.
Implications for model evaluation practices in time series analysis.
Abstract
It is well known that model selection via cross validation can be biased for time series models. However, many researchers have argued that this bias does not apply when using cross-validation with vector autoregressions (VAR) or with time series models whose errors follow a martingale-like structure. I show that even under these circumstances, performing cross-validation on time series data will still generate bias in general.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Monetary Policy and Economic Impact · Financial Risk and Volatility Modeling
