Verifying the Causes of Adversarial Examples
Honglin Li, Yifei Fan, Frieder Ganz, Anthony Yezzi, Payam Barnaghi

TL;DR
This paper investigates the underlying causes of adversarial examples in neural networks through controlled experiments, identifying geometry and statistical factors as key contributors to their formation.
Contribution
It systematically verifies potential causes of adversarial examples and introduces new techniques to control these effects, enhancing understanding of their origins.
Findings
Geometric factors are primary causes of adversarial examples.
Statistical factors amplify adversarial phenomena, especially at high confidence levels.
Controlled experiments validate the influence of model linearity and category geometry.
Abstract
The robustness of neural networks is challenged by adversarial examples that contain almost imperceptible perturbations to inputs, which mislead a classifier to incorrect outputs in high confidence. Limited by the extreme difficulty in examining a high-dimensional image space thoroughly, research on explaining and justifying the causes of adversarial examples falls behind studies on attacks and defenses. In this paper, we present a collection of potential causes of adversarial examples and verify (or partially verify) them through carefully-designed controlled experiments. The major causes of adversarial examples include model linearity, one-sum constraint, and geometry of the categories. To control the effect of those causes, multiple techniques are applied such as normalization, replacement of loss functions, construction of reference datasets, and novel models using multi-layer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications
