On the Geometry of Adversarial Examples
Marc Khoury, Dylan Hadfield-Menell

TL;DR
This paper introduces a geometric framework to analyze high-dimensional adversarial examples, revealing their relation to data manifold structure and decision boundary properties, and providing insights into robustness and training efficiency.
Contribution
It presents a novel geometric approach to understanding adversarial examples, including theoretical results on robustness tradeoffs and sampling conditions for classifiers.
Findings
Tradeoff between robustness under different norms
Adversarial training in balls is sample inefficient
Nearest neighbor classifiers can be robust under certain sampling conditions
Abstract
Adversarial examples are a pervasive phenomenon of machine learning models where seemingly imperceptible perturbations to the input lead to misclassifications for otherwise statistically accurate models. We propose a geometric framework, drawing on tools from the manifold reconstruction literature, to analyze the high-dimensional geometry of adversarial examples. In particular, we highlight the importance of codimension: for low-dimensional data manifolds embedded in high-dimensional space there are many directions off the manifold in which to construct adversarial examples. Adversarial examples are a natural consequence of learning a decision boundary that classifies the low-dimensional data manifold well, but classifies points near the manifold incorrectly. Using our geometric framework we prove (1) a tradeoff between robustness under different norms, (2) that adversarial training in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Cell Image Analysis Techniques
