A non-asymptotic theory of Kernel Ridge Regression: deterministic equivalents, test error, and GCV estimator
Theodor Misiakiewicz, Basil Saeed

TL;DR
This paper provides a non-asymptotic, deterministic approximation for the test error of Kernel Ridge Regression (KRR) that depends only on the kernel spectrum and target function alignment, with theoretical guarantees and practical GCV estimator insights.
Contribution
It establishes a general non-asymptotic theory for KRR test error, relaxing previous restrictive assumptions and providing explicit bounds based on spectral properties.
Findings
Deterministic approximation for KRR test error with explicit bounds.
GCV estimator concentrates on test error over a range of regularization parameters.
Excellent agreement between theory and numerical simulations.
Abstract
We consider learning an unknown target function using kernel ridge regression (KRR) given i.i.d. data , , where is a covariate vector and . A recent string of work has empirically shown that the test error of KRR can be well approximated by a closed-form estimate derived from an `equivalent' sequence model that only depends on the spectrum of the kernel operator. However, a theoretical justification for this equivalence has so far relied either on restrictive assumptions -- such as subgaussian independent eigenfunctions -- , or asymptotic derivations for specific kernels in high dimensions. In this paper, we prove that this equivalence holds for a general class of problems satisfying some spectral and concentration properties on the kernel eigendecomposition. Specifically, we establish in this setting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsControl Systems and Identification
