Adversarially Robust Generalization Requires More Data
Ludwig Schmidt, Shibani Santurkar, Dimitris Tsipras, Kunal Talwar and, Aleksander M\k{a}dry

TL;DR
This paper demonstrates that achieving adversarial robustness in machine learning models requires significantly more data than standard training, due to an inherent increase in sample complexity, supported by theoretical analysis and empirical evidence.
Contribution
It provides a theoretical and empirical analysis showing that robust learning demands more data, highlighting an intrinsic sample complexity gap independent of algorithms or models.
Findings
Robust learning has higher sample complexity than standard learning.
The sample complexity gap is information-theoretic and model-independent.
Experiments confirm the theoretical gap on real datasets.
Abstract
Machine learning models are often susceptible to adversarial perturbations of their inputs. Even small perturbations can cause state-of-the-art classifiers with high "standard" accuracy to produce an incorrect prediction with high confidence. To better understand this phenomenon, we study adversarially robust learning from the viewpoint of generalization. We show that already in a simple natural data model, the sample complexity of robust learning can be significantly larger than that of "standard" learning. This gap is information theoretic and holds irrespective of the training algorithm or the model family. We complement our theoretical results with experiments on popular image classification datasets and show that a similar gap exists here as well. We postulate that the difficulty of training robust classifiers stems, at least partially, from this inherently larger sample complexity.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Bacillus and Francisella bacterial research
