The Uncanny Valley: Exploring Adversarial Robustness from a Flatness Perspective
Nils Philipp Walter, Linara Adilova, Jilles Vreeken, Michael Kamp

TL;DR
This paper investigates the relationship between loss surface flatness and adversarial robustness, revealing a peculiar 'uncanny valley' phenomenon during attacks and emphasizing the need for combined flatness and Lipschitz constraints for robustness.
Contribution
It empirically analyzes the flatness-adversarial robustness relationship, uncovers the 'uncanny valley' phenomenon, and theoretically links flatness with the third derivative and Lipschitz constant.
Findings
Adversarial examples exhibit a sharp-to-flat transition during attacks.
Flatness alone does not ensure robustness without additional constraints.
The phenomenon is consistent across various models and datasets, including LLMs.
Abstract
Flatness of the loss surface not only correlates positively with generalization, but is also related to adversarial robustness since perturbations of inputs relate non-linearly to perturbations of weights. In this paper, we empirically analyze the relation between adversarial examples and relative flatness with respect to the parameters of one layer. We observe a peculiar property of adversarial examples in the context of relative flatness: during an iterative first-order white-box attack, the flatness of the loss surface measured around the adversarial example first becomes sharper until the label is flipped, but if we keep the attack running, it runs into a flat uncanny valley where the label remains flipped. In extensive experiments, we observe this phenomenon across various model architectures and datasets, even for adversarially trained models. Our results also extend to large…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHigh-Velocity Impact and Material Behavior · Bacillus and Francisella bacterial research · Energetic Materials and Combustion
