Generalization error of minimum weighted norm and kernel interpolation

Weilin Li

arXiv:2008.03365·math.NA·February 11, 2021·1 cites

Generalization error of minimum weighted norm and kernel interpolation

Weilin Li

PDF

Open Access

TL;DR

This paper analyzes how minimum weighted norm and kernel interpolation methods generalize as model complexity increases, revealing their implicit bias and convergence properties through a deterministic approximation theory approach.

Contribution

It provides a rigorous theoretical framework for understanding the implicit bias and generalization of weighted norm interpolants, including special cases like trigonometric polynomials and spherical harmonics.

Findings

01

Interpolants converge to a RKHS as parameters grow

02

Generalization errors diminish with increasing parameters

03

The implicit bias explains over-parameterization effects

Abstract

We study the generalization error of functions that interpolate prescribed data points and are selected by minimizing a weighted norm. Under natural and general conditions, we prove that both the interpolants and their generalization errors converge as the number of parameters grow, and the limiting interpolant belongs to a reproducing kernel Hilbert space. This rigorously establishes an implicit bias of minimum weighted norm interpolation and explains why norm minimization may either benefit or suffer from over-parameterization. As special cases of this theory, we study interpolation by trigonometric polynomials and spherical harmonics. Our approach is from a deterministic and approximation theory viewpoint, as opposed to a statistical or random matrix one.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Numerical methods in inverse problems · Matrix Theory and Algorithms