Intriguing Properties of Adversarial Examples

Ekin D. Cubuk; Barret Zoph; Samuel S. Schoenholz; Quoc V. Le

arXiv:1711.02846·stat.ML·November 9, 2017·23 cites

Intriguing Properties of Adversarial Examples

Ekin D. Cubuk, Barret Zoph, Samuel S. Schoenholz, Quoc V. Le

PDF

Open Access

TL;DR

This paper reveals that adversarial examples stem from an inherent uncertainty in neural network predictions, which is universal across architectures and datasets, and demonstrates how reducing entropy and architecture search can improve robustness.

Contribution

It identifies the fundamental source of adversarial vulnerability as prediction uncertainty and proposes methods to enhance robustness through entropy reduction and neural architecture search.

Findings

01

Adversarial error scales as a power-law with perturbation size.

02

Universal behavior of adversarial vulnerability across datasets and models.

03

Neural architecture search yields more robust models against attacks.

Abstract

It is becoming increasingly clear that many machine learning classifiers are vulnerable to adversarial examples. In attempting to explain the origin of adversarial examples, previous studies have typically focused on the fact that neural networks operate on high dimensional data, they overfit, or they are too linear. Here we argue that the origin of adversarial examples is primarily due to an inherent uncertainty that neural networks have about their predictions. We show that the functional form of this uncertainty is independent of architecture, dataset, and training protocol; and depends only on the statistics of the logit differences of the network, which do not change significantly during training. This leads to adversarial error having a universal scaling, as a power-law, with respect to the size of the adversarial perturbation. We show that this universality holds for a broad…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications