Mind the spikes: Benign overfitting of kernels and neural networks in fixed dimension
Moritz Haas, David Holzm\"uller, Ulrike von Luxburg, Ingo Steinwart

TL;DR
This paper investigates benign overfitting in fixed-dimensional settings, showing that estimator smoothness, rather than dimension, determines overfitting potential, and demonstrates how neural networks can be modified to overfit benignly.
Contribution
It establishes that benign overfitting depends on estimator derivatives in fixed dimension and introduces modifications to neural networks to achieve benign overfitting.
Findings
Benign overfitting is possible with large derivatives in fixed dimension.
Neural tangent kernels can be adjusted with high-frequency fluctuations to enable benign overfitting.
Experiments confirm neural networks can generalize well despite overfitting in low-dimensional data.
Abstract
The success of over-parameterized neural networks trained to near-zero training error has caused great interest in the phenomenon of benign overfitting, where estimators are statistically consistent even though they interpolate noisy training data. While benign overfitting in fixed dimension has been established for some learning methods, current literature suggests that for regression with typical kernel methods and wide neural networks, benign overfitting requires a high-dimensional setting where the dimension grows with the sample size. In this paper, we show that the smoothness of the estimators, and not the dimension, is the key: benign overfitting is possible if and only if the estimator's derivatives are large enough. We generalize existing inconsistency results to non-interpolating models and more kernels to show that benign overfitting with moderate derivatives is impossible in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNeural Networks and Applications · Machine Learning in Materials Science · Model Reduction and Neural Networks
