Kernel Alignment Risk Estimator: Risk Prediction from Training Data

Arthur Jacot; Berfin \c{S}im\c{s}ek; Francesco Spadaro; Cl\'ement; Hongler; Franck Gabriel

arXiv:2006.09796·stat.ML·June 18, 2020·23 cites

Kernel Alignment Risk Estimator: Risk Prediction from Training Data

Arthur Jacot, Berfin \c{S}im\c{s}ek, Francesco Spadaro, Cl\'ement, Hongler, Franck Gabriel

PDF

Open Access 1 Video

TL;DR

This paper introduces the Signal Capture Threshold and Kernel Alignment Risk Estimator to predict and approximate the generalization risk of Kernel Ridge Regression directly from training data, enabling better kernel and hyperparameter selection.

Contribution

It proposes the KARE and SCT as novel tools for risk prediction in KRR, providing a data-dependent method for kernel and hyperparameter selection based on training data.

Findings

01

KARE accurately approximates KRR risk on real datasets.

02

The approach supports kernel and hyperparameter comparison directly from training data.

03

Numerical experiments validate the universality assumption and effectiveness of the method.

Abstract

We study the risk (i.e. generalization error) of Kernel Ridge Regression (KRR) for a kernel $K$ with ridge $λ > 0$ and i.i.d. observations. For this, we introduce two objects: the Signal Capture Threshold (SCT) and the Kernel Alignment Risk Estimator (KARE). The SCT $ϑ_{K, λ}$ is a function of the data distribution: it can be used to identify the components of the data that the KRR predictor captures, and to approximate the (expected) KRR risk. This then leads to a KRR risk approximation by the KARE $ρ_{K, λ}$ , an explicit function of the training data, agnostic of the true data distribution. We phrase the regression problem in a functional setting. The key results then follow from a finite-size analysis of the Stieltjes transform of general Wishart random matrices. Under a natural universality assumption (that the KRR moments depend asymptotically on the first…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Kernel Alignment Risk Estimator: Risk Prediction from Training Data· slideslive

Taxonomy

TopicsStatistical Mechanics and Entropy · Sparse and Compressive Sensing Techniques · Statistical Methods and Inference