A Universal Law of Robustness via Isoperimetry

S\'ebastien Bubeck; Mark Sellke

arXiv:2105.12806·cs.LG·December 27, 2022

A Universal Law of Robustness via Isoperimetry

S\'ebastien Bubeck, Mark Sellke

PDF

1 Video

TL;DR

This paper establishes a universal law of robustness in data interpolation, showing that overparametrization is necessary for smooth interpolation across broad data distributions, with implications for understanding neural network generalization.

Contribution

It proves that smooth data interpolation requires significantly more parameters than mere interpolation, generalizing previous conjectures and linking robustness to isoperimetry.

Findings

01

Overparametrization is necessary for smooth interpolation in broad settings.

02

The universal law applies to various data distributions with isoperimetry.

03

Provides an improved generalization bound for smooth function classes.

Abstract

Classically, data interpolation with a parametrized model class is possible as long as the number of parameters is larger than the number of equations to be satisfied. A puzzling phenomenon in deep learning is that models are trained with many more parameters than what this classical theory would suggest. We propose a partial theoretical explanation for this phenomenon. We prove that for a broad class of data distributions and model classes, overparametrization is necessary if one wants to interpolate the data smoothly. Namely we show that smooth interpolation requires $d$ times more parameters than mere interpolation, where $d$ is the ambient data dimension. We prove this universal law of robustness for any smoothly parametrized function class with polynomial size weights, and any covariate distribution verifying isoperimetry. In the case of two-layers neural networks and Gaussian…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

A Universal Law of Robustness via Isoperimetry· slideslive