Kernel Alignment Risk Estimator: Risk Prediction from Training Data
Arthur Jacot, Berfin \c{S}im\c{s}ek, Francesco Spadaro, Cl\'ement, Hongler, Franck Gabriel

TL;DR
This paper introduces the Signal Capture Threshold and Kernel Alignment Risk Estimator to predict and approximate the generalization risk of Kernel Ridge Regression directly from training data, enabling better kernel and hyperparameter selection.
Contribution
It proposes the KARE and SCT as novel tools for risk prediction in KRR, providing a data-dependent method for kernel and hyperparameter selection based on training data.
Findings
KARE accurately approximates KRR risk on real datasets.
The approach supports kernel and hyperparameter comparison directly from training data.
Numerical experiments validate the universality assumption and effectiveness of the method.
Abstract
We study the risk (i.e. generalization error) of Kernel Ridge Regression (KRR) for a kernel with ridge and i.i.d. observations. For this, we introduce two objects: the Signal Capture Threshold (SCT) and the Kernel Alignment Risk Estimator (KARE). The SCT is a function of the data distribution: it can be used to identify the components of the data that the KRR predictor captures, and to approximate the (expected) KRR risk. This then leads to a KRR risk approximation by the KARE , an explicit function of the training data, agnostic of the true data distribution. We phrase the regression problem in a functional setting. The key results then follow from a finite-size analysis of the Stieltjes transform of general Wishart random matrices. Under a natural universality assumption (that the KRR moments depend asymptotically on the first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsStatistical Mechanics and Entropy · Sparse and Compressive Sensing Techniques · Statistical Methods and Inference
