A non-asymptotic theory of Kernel Ridge Regression: deterministic   equivalents, test error, and GCV estimator

Theodor Misiakiewicz; Basil Saeed

arXiv:2403.08938·stat.ML·March 15, 2024·1 cites

A non-asymptotic theory of Kernel Ridge Regression: deterministic equivalents, test error, and GCV estimator

Theodor Misiakiewicz, Basil Saeed

PDF

Open Access

TL;DR

This paper provides a non-asymptotic, deterministic approximation for the test error of Kernel Ridge Regression (KRR) that depends only on the kernel spectrum and target function alignment, with theoretical guarantees and practical GCV estimator insights.

Contribution

It establishes a general non-asymptotic theory for KRR test error, relaxing previous restrictive assumptions and providing explicit bounds based on spectral properties.

Findings

01

Deterministic approximation for KRR test error with explicit bounds.

02

GCV estimator concentrates on test error over a range of regularization parameters.

03

Excellent agreement between theory and numerical simulations.

Abstract

We consider learning an unknown target function $f_{*}$ using kernel ridge regression (KRR) given i.i.d. data $(u_{i}, y_{i})$ , $i \leq n$ , where $u_{i} \in U$ is a covariate vector and $y_{i} = f_{*} (u_{i}) + ε_{i} \in R$ . A recent string of work has empirically shown that the test error of KRR can be well approximated by a closed-form estimate derived from an `equivalent' sequence model that only depends on the spectrum of the kernel operator. However, a theoretical justification for this equivalence has so far relied either on restrictive assumptions -- such as subgaussian independent eigenfunctions -- , or asymptotic derivations for specific kernels in high dimensions. In this paper, we prove that this equivalence holds for a general class of problems satisfying some spectral and concentration properties on the kernel eigendecomposition. Specifically, we establish in this setting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsControl Systems and Identification