Harmless interpolation in regression and classification with structured   features

Andrew D. McRae; Santhosh Karnik; Mark A. Davenport; Vidya; Muthukumar

arXiv:2111.05198·stat.ML·February 23, 2022

Harmless interpolation in regression and classification with structured features

Andrew D. McRae, Santhosh Karnik, Mark A. Davenport, Vidya, Muthukumar

PDF

Open Access

TL;DR

This paper develops a general framework to understand when overparameterized models with structured features, such as kernels, can interpolate noisy data without harming test performance, extending previous results beyond independent features.

Contribution

It introduces a flexible framework for analyzing harmless interpolation in structured feature spaces, including bounded orthonormal systems, and clarifies differences between classification and regression performance.

Findings

01

Harmless interpolation occurs under specific Gram matrix conditions.

02

Results extend prior independent-feature analyses to more general feature structures.

03

Asymptotic separation between classification and regression performance is demonstrated.

Abstract

Overparametrized neural networks tend to perfectly fit noisy training data yet generalize well on test data. Inspired by this empirical observation, recent work has sought to understand this phenomenon of benign overfitting or harmless interpolation in the much simpler linear model. Previous theoretical work critically assumes that either the data features are statistically independent or the input data is high-dimensional; this precludes general nonparametric settings with structured feature maps. In this paper, we present a general and flexible framework for upper bounding regression and classification risk in a reproducing kernel Hilbert space. A key contribution is that our framework describes precise sufficient conditions on the data Gram matrix under which harmless interpolation occurs. Our results recover prior independent-features results (with a much simpler analysis), but they…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Neural Networks and Applications