A Law of Robustness for Weight-bounded Neural Networks
Hisham Husain, Borja Balle

TL;DR
This paper establishes a fundamental lower bound on the Lipschitz constant for neural networks with bounded weights, demonstrating the necessity of over-parameterization for robustness against adversarial attacks.
Contribution
It derives a general lower bound on the Lipschitz constant for any model class with bounded Rademacher complexity, extending previous conjectures to multi-layer networks.
Findings
Lower bound matches conjecture for two-layer networks with bounded weights.
Multi-layer networks require log n constant-sized layers for robust fitting.
Over-parameterization is necessary for robustness in deep neural networks.
Abstract
Robustness of deep neural networks against adversarial perturbations is a pressing concern motivated by recent findings showing the pervasive nature of such vulnerabilities. One method of characterizing the robustness of a neural network model is through its Lipschitz constant, which forms a robustness certificate. A natural question to ask is, for a fixed model class (such as neural networks) and a dataset of size , what is the smallest achievable Lipschitz constant among all models that fit the dataset? Recently, (Bubeck et al., 2020) conjectured that when using two-layer networks with neurons to fit a generic dataset, the smallest Lipschitz constant is . This implies that one would require one neuron per data point to robustly fit the data. In this work we derive a lower bound on the Lipschitz constant for any arbitrary model class with bounded…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Adversarial Robustness in Machine Learning · Machine Learning and Algorithms
