Exploiting Excessive Invariance caused by Norm-Bounded Adversarial Robustness
J\"orn-Henrik Jacobsen, Jens Behrmannn, Nicholas Carlini and, Florian Tram\`er, Nicolas Papernot

TL;DR
This paper reveals that robustness to perturbation-based adversarial examples can increase vulnerability to invariance-based adversarial examples, highlighting a need for more comprehensive robustness definitions.
Contribution
It demonstrates that current $ ext{l}_p$-norm robustness can cause excessive invariance, making models more susceptible to invariance-based adversarial attacks, and calls for precise adversarial definitions.
Findings
Robust models are more vulnerable to invariance-based adversarial examples.
Invariance-based attacks can find adversarial examples within the $ ext{l}_p$ ball.
Current robustness methods may inadvertently increase model vulnerability.
Abstract
Adversarial examples are malicious inputs crafted to cause a model to misclassify them. Their most common instantiation, "perturbation-based" adversarial examples introduce changes to the input that leave its true label unchanged, yet result in a different model prediction. Conversely, "invariance-based" adversarial examples insert changes to the input that leave the model's prediction unaffected despite the underlying input's label having changed. In this paper, we demonstrate that robustness to perturbation-based adversarial examples is not only insufficient for general robustness, but worse, it can also increase vulnerability of the model to invariance-based adversarial examples. In addition to analytical constructions, we empirically study vision classifiers with state-of-the-art robustness to perturbation-based adversaries constrained by an norm. We mount attacks that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research
