Intriguing Properties of Adversarial Examples
Ekin D. Cubuk, Barret Zoph, Samuel S. Schoenholz, Quoc V. Le

TL;DR
This paper reveals that adversarial examples stem from an inherent uncertainty in neural network predictions, which is universal across architectures and datasets, and demonstrates how reducing entropy and architecture search can improve robustness.
Contribution
It identifies the fundamental source of adversarial vulnerability as prediction uncertainty and proposes methods to enhance robustness through entropy reduction and neural architecture search.
Findings
Adversarial error scales as a power-law with perturbation size.
Universal behavior of adversarial vulnerability across datasets and models.
Neural architecture search yields more robust models against attacks.
Abstract
It is becoming increasingly clear that many machine learning classifiers are vulnerable to adversarial examples. In attempting to explain the origin of adversarial examples, previous studies have typically focused on the fact that neural networks operate on high dimensional data, they overfit, or they are too linear. Here we argue that the origin of adversarial examples is primarily due to an inherent uncertainty that neural networks have about their predictions. We show that the functional form of this uncertainty is independent of architecture, dataset, and training protocol; and depends only on the statistics of the logit differences of the network, which do not change significantly during training. This leads to adversarial error having a universal scaling, as a power-law, with respect to the size of the adversarial perturbation. We show that this universality holds for a broad…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications
