Localized Uncertainty Attacks
Ousmane Amadou Dia, Theofanis Karaletsos, Caner Hazirbas, Cristian, Canton Ferrer, Ilknur Kaynar Kabul, Erik Meijer

TL;DR
This paper introduces localized uncertainty attacks that generate adversarial examples by perturbing only uncertain regions of inputs, making attacks less perceptible while remaining effective against classifiers.
Contribution
It proposes a new threat model focusing on perturbing uncertain regions, utilizing classifier uncertainty or surrogate models, to craft more subtle adversarial examples.
Findings
Effective against both deterministic and stochastic classifiers
Produces less perceptible adversarial examples
Maintains high attack success rates
Abstract
The susceptibility of deep learning models to adversarial perturbations has stirred renewed attention in adversarial examples resulting in a number of attacks. However, most of these attacks fail to encompass a large spectrum of adversarial perturbations that are imperceptible to humans. In this paper, we present localized uncertainty attacks, a novel class of threat models against deterministic and stochastic classifiers. Under this threat model, we create adversarial examples by perturbing only regions in the inputs where a classifier is uncertain. To find such regions, we utilize the predictive uncertainty of the classifier when the classifier is stochastic or, we learn a surrogate model to amortize the uncertainty when it is deterministic. Unlike ball or functional attacks which perturb inputs indiscriminately, our targeted changes can be less perceptible. When considered…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
