Generalizable Adversarial Attacks with Latent Variable Perturbation Modelling
Avishek Joey Bose, Andre Cianflone, William L. Hamilton

TL;DR
This paper introduces a novel framework for adversarial attacks that models a distribution of perturbations, enabling diverse and domain-agnostic attacks across images, text, and graphs, with high success rates and attack generalization.
Contribution
It presents a new approach to generate a distribution of adversarial perturbations, improving diversity, domain adaptability, and zero-shot attack capabilities compared to traditional single-perturbation methods.
Findings
Achieves state-of-the-art attack success rates in graphs.
Generates diverse adversarial examples efficiently.
Effective across multiple domains: images, text, and graphs.
Abstract
Adversarial attacks on deep neural networks traditionally rely on a constrained optimization paradigm, where an optimization procedure is used to obtain a single adversarial perturbation for a given input example. In this work we frame the problem as learning a distribution of adversarial perturbations, enabling us to generate diverse adversarial distributions given an unperturbed input. We show that this framework is domain-agnostic in that the same framework can be employed to attack different input domains with minimal modification. Across three diverse domains---images, text, and graphs---our approach generates whitebox attacks with success rates that are competitive with or superior to existing approaches, with a new state-of-the-art achieved in the graph domain. Finally, we demonstrate that our framework can efficiently generate a diverse set of attacks for a single given input,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications
