The Statistical Cost of Robust Kernel Hyperparameter Tuning
Raphael A. Meyer, Christopher Musco

TL;DR
This paper analyzes the statistical complexity of tuning kernel hyperparameters in active regression with adversarial noise, showing that the additional sample complexity is only logarithmic for common kernels.
Contribution
It provides finite-sample guarantees for hyperparameter tuning, revealing that the complexity increase is minimal for typical kernel classes.
Findings
Hyperparameter tuning adds logarithmic sample complexity.
Finite-sample guarantees are established for kernel hyperparameter selection.
Results apply to common kernels like squared-exponential with unknown parameters.
Abstract
This paper studies the statistical complexity of kernel hyperparameter tuning in the setting of active regression under adversarial noise. We consider the problem of finding the best interpolant from a class of kernels with unknown hyperparameters, assuming only that the noise is square-integrable. We provide finite-sample guarantees for the problem, characterizing how increasing the complexity of the kernel class increases the complexity of learning kernel hyperparameters. For common kernel classes (e.g. squared-exponential kernels with unknown lengthscale), our results show that hyperparameter optimization increases sample complexity by just a logarithmic factor, in comparison to the setting where optimal parameters are known in advance. Our result is based on a subsampling guarantee for linear regression under multiple design matrices, combined with an {\epsilon}-net argument for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Advanced Bandit Algorithms Research · Machine Learning and Algorithms
MethodsLinear Regression
