Adversarially Robust Generalization Requires More Data

Ludwig Schmidt; Shibani Santurkar; Dimitris Tsipras; Kunal Talwar and; Aleksander M\k{a}dry

arXiv:1804.11285·cs.LG·May 3, 2018·359 cites

Adversarially Robust Generalization Requires More Data

Ludwig Schmidt, Shibani Santurkar, Dimitris Tsipras, Kunal Talwar and, Aleksander M\k{a}dry

PDF

Open Access

TL;DR

This paper demonstrates that achieving adversarial robustness in machine learning models requires significantly more data than standard training, due to an inherent increase in sample complexity, supported by theoretical analysis and empirical evidence.

Contribution

It provides a theoretical and empirical analysis showing that robust learning demands more data, highlighting an intrinsic sample complexity gap independent of algorithms or models.

Findings

01

Robust learning has higher sample complexity than standard learning.

02

The sample complexity gap is information-theoretic and model-independent.

03

Experiments confirm the theoretical gap on real datasets.

Abstract

Machine learning models are often susceptible to adversarial perturbations of their inputs. Even small perturbations can cause state-of-the-art classifiers with high "standard" accuracy to produce an incorrect prediction with high confidence. To better understand this phenomenon, we study adversarially robust learning from the viewpoint of generalization. We show that already in a simple natural data model, the sample complexity of robust learning can be significantly larger than that of "standard" learning. This gap is information theoretic and holds irrespective of the training algorithm or the model family. We complement our theoretical results with experiments on popular image classification datasets and show that a similar gap exists here as well. We postulate that the difficulty of training robust classifiers stems, at least partially, from this inherently larger sample complexity.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Bacillus and Francisella bacterial research