Benign Overfitting and Noisy Features

Zhu Li; Weijie Su; Dino Sejdinovic

arXiv:2008.02901·stat.ML·February 8, 2021·6 cites

Benign Overfitting and Noisy Features

Zhu Li, Weijie Su, Dino Sejdinovic

PDF

Open Access

TL;DR

This paper investigates benign overfitting in random feature models, revealing that noise in features plays a crucial regularization role enabling models to generalize well despite overparameterization.

Contribution

It provides a new perspective on benign overfitting by highlighting the role of noise in features within random neural network models.

Findings

01

Benign overfitting occurs due to noise in features.

02

Noise in features acts as an implicit regularizer.

03

The phenomenon is analyzed in two-layer neural networks with fixed first layer weights.

Abstract

Modern machine learning often operates in the regime where the number of parameters is much higher than the number of data points, with zero training loss and yet good generalization, thereby contradicting the classical bias-variance trade-off. This \textit{benign overfitting} phenomenon has recently been characterized using so called \textit{double descent} curves where the risk undergoes another descent (in addition to the classical U-shaped learning curve when the number of parameters is small) as we increase the number of parameters beyond a certain threshold. In this paper, we examine the conditions under which \textit{Benign Overfitting} occurs in the random feature (RF) models, i.e. in a two-layer neural network with fixed first layer weights. We adopt a new view of random feature and show that \textit{benign overfitting} arises due to the noise which resides in such features…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Neural Networks and Applications · Machine Learning and ELM