Adversarial Spheres

Justin Gilmer; Luke Metz; Fartash Faghri; Samuel S. Schoenholz,; Maithra Raghu; Martin Wattenberg; and Ian Goodfellow

arXiv:1801.02774·cs.CV·September 11, 2018·47 cites

Adversarial Spheres

Justin Gilmer, Luke Metz, Fartash Faghri, Samuel S. Schoenholz,, Maithra Raghu, Martin Wattenberg, and Ian Goodfellow

PDF

Open Access 2 Repos

TL;DR

This paper investigates the geometric reasons behind adversarial vulnerability in high-dimensional data, using a simple spherical dataset to demonstrate fundamental tradeoffs between error and robustness.

Contribution

It introduces a theoretical framework linking high-dimensional geometry to adversarial vulnerability, supported by analysis of a synthetic spherical dataset.

Findings

01

Any model misclassifying a small fraction of the sphere is vulnerable to small perturbations.

02

All tested architectures approach the theoretical vulnerability bound.

03

Vulnerability is a natural consequence of high-dimensional geometry and test error.

Abstract

State of the art computer vision models have been shown to be vulnerable to small adversarial perturbations of the input. In other words, most images in the data distribution are both correctly classified by the model and are very close to a visually similar misclassified image. Despite substantial research interest, the cause of the phenomenon is still poorly understood and remains unsolved. We hypothesize that this counter intuitive behavior is a naturally occurring result of the high dimensional geometry of the data manifold. As a first step towards exploring this hypothesis, we study a simple synthetic dataset of classifying between two concentric high dimensional spheres. For this dataset we show a fundamental tradeoff between the amount of test error and the average distance to nearest error. In particular, we prove that any model which misclassifies a small constant fraction of a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Malware Detection Techniques