Learning Curves and Benign Overfitting of Spectral Algorithms in Large Dimensions
Weihao Lu, Qian Lin, Yingcun Xia, Dongming Huang

TL;DR
This paper characterizes the learning curve and benign overfitting of spectral algorithms in high dimensions, revealing three regimes and demonstrating benign overfitting occurs under certain conditions.
Contribution
It provides a comprehensive asymptotic analysis of spectral kernel methods across all regularization regimes in large dimensions, including the under-regularized and interpolation regimes.
Findings
Learning curve has three regimes: over-regularized, under-regularized, and interpolation.
Benign overfitting occurs in under-regularized and interpolation regimes for positive source smoothness.
Kernel learning curve can be approximated by a sequence model in the regularized regime.
Abstract
Existing large-dimensional theory for spectral algorithms resolves either the optimally tuned point or the interpolation limit, but leaves the under-regularized regime unexplored. We study the learning curve and benign overfitting of spectral algorithms in the large-dimensional setting where the sample size and dimension are of comparable order, i.e., for some . We first consider inner-product kernels on the sphere and establish a sharp asymptotic characterization of the excess risk across the full regularization path under various source conditions , where measures the relative smoothness of the regression function. Our results reveal that the learning curve is not simply U-shaped but instead consists of three distinct regimes: over-regularized, under-regularized, and interpolation regimes. This characterization allows us…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
