Fast Computation of Leave-One-Out Cross-Validation for $k$-NN Regression
Motonobu Kanagawa

TL;DR
This paper introduces a computationally efficient method for leave-one-out cross-validation in $k$-NN regression, reducing the need for multiple model fits by leveraging a relationship with $(k+1)$-NN regression.
Contribution
The authors establish a novel theoretical link that allows LOOCV for $k$-NN to be computed using a single $(k+1)$-NN regression fit, significantly speeding up the process.
Findings
The method is validated through numerical experiments.
LOOCV score can be computed from a single $(k+1)$-NN fit.
The approach reduces computational complexity for $k$-NN regression.
Abstract
We describe a fast computation method for leave-one-out cross-validation (LOOCV) for -nearest neighbours (-NN) regression. We show that, under a tie-breaking condition for nearest neighbours, the LOOCV estimate of the mean square error for -NN regression is identical to the mean square error of -NN regression evaluated on the training data, multiplied by the scaling factor . Therefore, to compute the LOOCV score, one only needs to fit -NN regression only once, and does not need to repeat training-validation of -NN regression for the number of training data. Numerical experiments confirm the validity of the fast computation method.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Statistical Methods and Bayesian Inference
