Learning Curves and Benign Overfitting of Spectral Algorithms in Large Dimensions

Weihao Lu; Qian Lin; Yingcun Xia; Dongming Huang

arXiv:2604.23212·stat.ML·April 28, 2026

Learning Curves and Benign Overfitting of Spectral Algorithms in Large Dimensions

Weihao Lu, Qian Lin, Yingcun Xia, Dongming Huang

PDF

TL;DR

This paper characterizes the learning curve and benign overfitting of spectral algorithms in high dimensions, revealing three regimes and demonstrating benign overfitting occurs under certain conditions.

Contribution

It provides a comprehensive asymptotic analysis of spectral kernel methods across all regularization regimes in large dimensions, including the under-regularized and interpolation regimes.

Findings

01

Learning curve has three regimes: over-regularized, under-regularized, and interpolation.

02

Benign overfitting occurs in under-regularized and interpolation regimes for positive source smoothness.

03

Kernel learning curve can be approximated by a sequence model in the regularized regime.

Abstract

Existing large-dimensional theory for spectral algorithms resolves either the optimally tuned point or the interpolation limit, but leaves the under-regularized regime unexplored. We study the learning curve and benign overfitting of spectral algorithms in the large-dimensional setting where the sample size and dimension are of comparable order, i.e., $n ≍ d^{γ}$ for some $γ > 0$ . We first consider inner-product kernels on the sphere $S^{d - 1}$ and establish a sharp asymptotic characterization of the excess risk across the full regularization path under various source conditions $s \geq 0$ , where $s$ measures the relative smoothness of the regression function. Our results reveal that the learning curve is not simply U-shaped but instead consists of three distinct regimes: over-regularized, under-regularized, and interpolation regimes. This characterization allows us…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.