Diverse Generative Perturbations on Attention Space for Transferable   Adversarial Attacks

Woo Jae Kim; Seunghoon Hong; and Sung-Eui Yoon

arXiv:2208.05650·cs.CV·December 5, 2022

Diverse Generative Perturbations on Attention Space for Transferable Adversarial Attacks

Woo Jae Kim, Seunghoon Hong, and Sung-Eui Yoon

PDF

Open Access 1 Repo

TL;DR

This paper introduces ADA, a stochastic adversarial attack method that perturbs attention features diversely to enhance transferability across models, outperforming existing techniques.

Contribution

The paper proposes a novel stochastic perturbation approach disrupting attention features to improve transferability of adversarial attacks.

Findings

01

ADA outperforms state-of-the-art transferability methods.

02

Stochastic perturbations explore the loss surface more effectively.

03

Disrupting attention features enhances attack transferability.

Abstract

Adversarial attacks with improved transferability - the ability of an adversarial example crafted on a known model to also fool unknown models - have recently received much attention due to their practicality. Nevertheless, existing transferable attacks craft perturbations in a deterministic manner and often fail to fully explore the loss surface, thus falling into a poor local optimum and suffering from low transferability. To solve this problem, we propose Attentive-Diversity Attack (ADA), which disrupts diverse salient features in a stochastic manner to improve transferability. Primarily, we perturb the image attention to disrupt universal features shared by different models. Then, to effectively avoid poor local optima, we disrupt these features in a stochastic manner and explore the search space of transferable perturbations more exhaustively. More specifically, we use a generator…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wkim97/ada
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications