Extending the Scope of Inference About Predictive Ability to Machine Learning Methods
Juan Carlos Escanciano, Ricardo Parra

TL;DR
This paper extends classical inference methods for predictive ability to modern machine learning models in time series, addressing conditions for valid inference and demonstrating applications with simulations and empirical data.
Contribution
It develops conditions under which inference about predictive ability remains valid for machine learning methods like Lasso and Deep Learning in time series analysis.
Findings
Valid inference requires zero-mean score and fast convergence rate.
Sample splitting at 80%-20% improves inference accuracy.
Theoretical results are supported by simulations and empirical analysis.
Abstract
The use of machine learning methods for predictive purposes has increased dramatically over the past two decades, but uncertainty quantification for predictive comparisons remains elusive. This paper addresses this gap by extending the classic inference theory for predictive ability in time series to modern machine learners, such as the Lasso or Deep Learning. We investigate under which conditions such extensions are possible. For standard out-of-sample asymptotic inference to be valid with machine learning, two key properties must hold: (I) a zero-mean condition for the score of the prediction loss function and (ii) a "fast rate" of convergence for the machine learner. Absent any of these conditions, the estimation risk may be unbounded, and inferences invalid and very sensitive to sample splitting. For accurate inferences, we recommend an 80%-20% training-test splitting rule. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOnline Learning and Analytics
