On Neighbourhood Cross Validation
Simon N. Wood

TL;DR
This paper introduces an efficient method for computing and optimizing neighborhood cross validation criteria, enabling practical use in models with short-range autocorrelation and reducing computational costs significantly.
Contribution
It presents a novel approach to efficiently compute and optimize neighborhood cross validation for penalized regression models, addressing autocorrelation issues and reducing computational costs.
Findings
Enables practical use of leave-out-neighbourhood cross validation.
Reduces hyperparameter estimation cost to comparable to a single model fit.
Provides accurate uncertainty quantification despite autocorrelation.
Abstract
Many varieties of cross validation would be statistically appealing for the estimation of smoothing and other penalized regression hyperparameters, were it not for the high cost of evaluating such criteria. Here it is shown how to efficiently and accurately compute and optimize a broad variety of cross validation criteria for a wide range of models estimated by minimizing a quadratically penalized loss. The leading order computational cost of hyperparameter estimation is made comparable to the cost of a single model fit given hyperparameters. In many cases this represents an computational saving when modelling data. This development makes if feasible, for the first time, to use leave-out-neighbourhood cross validation to deal with the wide spread problem of un-modelled short range autocorrelation which otherwise leads to underestimation of smoothing parameters. It is also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Criteria Decision Making
