Exploiting Excessive Invariance caused by Norm-Bounded Adversarial   Robustness

J\"orn-Henrik Jacobsen; Jens Behrmannn; Nicholas Carlini and; Florian Tram\`er; Nicolas Papernot

arXiv:1903.10484·cs.LG·March 26, 2019·19 cites

Exploiting Excessive Invariance caused by Norm-Bounded Adversarial Robustness

J\"orn-Henrik Jacobsen, Jens Behrmannn, Nicholas Carlini and, Florian Tram\`er, Nicolas Papernot

PDF

Open Access

TL;DR

This paper reveals that robustness to perturbation-based adversarial examples can increase vulnerability to invariance-based adversarial examples, highlighting a need for more comprehensive robustness definitions.

Contribution

It demonstrates that current $ ext{l}_p$-norm robustness can cause excessive invariance, making models more susceptible to invariance-based adversarial attacks, and calls for precise adversarial definitions.

Findings

01

Robust models are more vulnerable to invariance-based adversarial examples.

02

Invariance-based attacks can find adversarial examples within the $ ext{l}_p$ ball.

03

Current robustness methods may inadvertently increase model vulnerability.

Abstract

Adversarial examples are malicious inputs crafted to cause a model to misclassify them. Their most common instantiation, "perturbation-based" adversarial examples introduce changes to the input that leave its true label unchanged, yet result in a different model prediction. Conversely, "invariance-based" adversarial examples insert changes to the input that leave the model's prediction unaffected despite the underlying input's label having changed. In this paper, we demonstrate that robustness to perturbation-based adversarial examples is not only insufficient for general robustness, but worse, it can also increase vulnerability of the model to invariance-based adversarial examples. In addition to analytical constructions, we empirically study vision classifiers with state-of-the-art robustness to perturbation-based adversaries constrained by an $ℓ_{p}$ norm. We mount attacks that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research