Minimum discrepancy principle strategy for choosing $k$ in $k$-NN regression
Yaroslav Averyanov, Alain Celisse

TL;DR
This paper introduces a new data-driven method based on the minimum discrepancy principle for selecting the hyperparameter $k$ in $k$-NN regression, which is computationally efficient and statistically optimal.
Contribution
The paper proposes a novel early stopping-based strategy for choosing $k$ in $k$-NN regression that is both minimax-optimal and computationally more efficient than existing methods.
Findings
Method improves statistical performance over traditional strategies.
Strategy reduces computational time significantly.
Proven minimax-optimal over certain smoothness classes.
Abstract
We present a novel data-driven strategy to choose the hyperparameter in the -NN regression estimator without using any hold-out data. We treat the problem of choosing the hyperparameter as an iterative procedure (over ) and propose using an easily implemented in practice strategy based on the idea of early stopping and the minimum discrepancy principle. This model selection strategy is proven to be minimax-optimal over some smoothness function classes, for instance, the Lipschitz functions class on a bounded domain. The novel method often improves statistical performance on artificial and real-world data sets in comparison to other model selection strategies, such as the Hold-out method, 5-fold cross-validation, and AIC criterion. The novelty of the strategy comes from reducing the computational time of the model selection procedure while preserving the statistical (minimax)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Statistical Methods and Inference · Advanced Statistical Process Monitoring
MethodsEarly Stopping
