On the Geometry of Adversarial Examples

Marc Khoury; Dylan Hadfield-Menell

arXiv:1811.00525·cs.LG·December 13, 2018·43 cites

On the Geometry of Adversarial Examples

Marc Khoury, Dylan Hadfield-Menell

PDF

Open Access

TL;DR

This paper introduces a geometric framework to analyze high-dimensional adversarial examples, revealing their relation to data manifold structure and decision boundary properties, and providing insights into robustness and training efficiency.

Contribution

It presents a novel geometric approach to understanding adversarial examples, including theoretical results on robustness tradeoffs and sampling conditions for classifiers.

Findings

01

Tradeoff between robustness under different norms

02

Adversarial training in balls is sample inefficient

03

Nearest neighbor classifiers can be robust under certain sampling conditions

Abstract

Adversarial examples are a pervasive phenomenon of machine learning models where seemingly imperceptible perturbations to the input lead to misclassifications for otherwise statistically accurate models. We propose a geometric framework, drawing on tools from the manifold reconstruction literature, to analyze the high-dimensional geometry of adversarial examples. In particular, we highlight the importance of codimension: for low-dimensional data manifolds embedded in high-dimensional space there are many directions off the manifold in which to construct adversarial examples. Adversarial examples are a natural consequence of learning a decision boundary that classifies the low-dimensional data manifold well, but classifies points near the manifold incorrectly. Using our geometric framework we prove (1) a tradeoff between robustness under different norms, (2) that adversarial training in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Cell Image Analysis Techniques