A Boundary Tilting Persepective on the Phenomenon of Adversarial Examples
Thomas Tanay, Lewis Griffin

TL;DR
This paper introduces a boundary tilting perspective on adversarial examples, arguing they arise when decision boundaries are close to data manifolds, and shows regularization can mitigate the strongest adversarial attacks.
Contribution
It proposes a new geometric perspective on adversarial examples, emphasizing boundary tilting and introduces a taxonomy based on adversarial strength, challenging the linear explanation.
Findings
Adversarial strength relates to the deviation angle from the nearest centroid classifier.
Boundary tilting can increase adversarial strength independently of classification accuracy.
Proper regularization reduces the level of adversarial strength and prevents overfitting-related adversarial examples.
Abstract
Deep neural networks have been shown to suffer from a surprising weakness: their classification outputs can be changed by small, non-random perturbations of their inputs. This adversarial example phenomenon has been explained as originating from deep networks being "too linear" (Goodfellow et al., 2014). We show here that the linear explanation of adversarial examples presents a number of limitations: the formal argument is not convincing, linear classifiers do not always suffer from the phenomenon, and when they do their adversarial examples are different from the ones affecting deep networks. We propose a new perspective on the phenomenon. We argue that adversarial examples exist when the classification boundary lies close to the submanifold of sampled data, and present a mathematical analysis of this new perspective in the linear case. We define the notion of adversarial strength…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Explainable Artificial Intelligence (XAI)
