Optimal Rate of Kernel Regression in Large Dimensions
Weihao Lu, Haobo Zhang, Yicheng Li, Manyun Xu, Qian Lin

TL;DR
This paper analyzes the optimal convergence rates of kernel regression in high-dimensional settings, revealing new phenomena like multiple descent and periodic plateau behaviors, with implications for neural networks and the neural tangent kernel.
Contribution
It introduces a general framework to characterize minimax bounds for kernel regression in large dimensions and identifies the precise rates and phenomena across different sample size regimes.
Findings
Minimax rate of $n^{-1/2}$ for $ ext{γ} = 2, 4, 6, 8, ext{etc.}$
Discovery of multiple descent and periodic plateau behaviors in the optimal rate curve
Explicit description of the optimal rate curve for the neural tangent kernel (NTK)
Abstract
We perform a study on kernel regression for large-dimensional data (where the sample size is polynomially depending on the dimension of the samples, i.e., for some ). We first build a general tool to characterize the upper bound and the minimax lower bound of kernel regression for large dimensional data through the Mendelson complexity and the metric entropy respectively. When the target function falls into the RKHS associated with a (general) inner product model defined on , we utilize the new tool to show that the minimax rate of the excess risk of kernel regression is when for . We then further determine the optimal rate of the excess risk of kernel regression for all the and find that the curve of optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Machine Learning and Algorithms · Mathematical Approximation and Integration
