Convergence rates of Kernel Conjugate Gradient for random design   regression

Gilles Blanchard; Nicole Kr\"amer

arXiv:1607.02387·math.ST·July 11, 2016

Convergence rates of Kernel Conjugate Gradient for random design regression

Gilles Blanchard, Nicole Kr\"amer

PDF

TL;DR

This paper establishes convergence rates for kernel conjugate gradient regression with early stopping, showing near-optimal performance depending on the target function's regularity and data complexity.

Contribution

It provides the first statistical convergence rates for kernel conjugate gradient regression with early stopping, matching minimax bounds under various conditions.

Findings

01

Convergence rates depend on target function regularity and data complexity.

02

Rates match known minimax lower bounds for prediction and Hilbert norms.

03

Additional unlabeled data improve convergence when the true function is outside the RKHS.

Abstract

We prove statistical rates of convergence for kernel-based least squares regression from i.i.d. data using a conjugate gradient algorithm, where regularization against overfitting is obtained by early stopping. This method is related to Kernel Partial Least Squares, a regression method that combines supervised dimensionality reduction with least squares projection. Following the setting introduced in earlier related literature, we study so-called "fast convergence rates" depending on the regularity of the target regression function (measured by a source condition in terms of the kernel integral operator) and on the effective dimensionality of the data mapped into the kernel space. We obtain upper bounds, essentially matching known minimax lower bounds, for the $L^{2}$ (prediction) norm as well as for the stronger Hilbert norm, if the true regression function belongs to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.