Boosting the Transferability of Adversarial Attacks with Reverse Adversarial Perturbation
Zeyu Qin, Yanbo Fan, Yi Liu, Li Shen, Yong Zhang, Jue Wang, Baoyuan Wu

TL;DR
This paper introduces a novel adversarial attack method called reverse adversarial perturbation (RAP) that enhances the transferability of adversarial examples across different models by seeking more stable perturbations, significantly improving attack success rates.
Contribution
The paper proposes RAP, a new bi-level optimization-based attack method that reduces overfitting of surrogate models, thereby improving transferability of adversarial examples in black-box settings.
Findings
RAP significantly boosts transferability of adversarial attacks.
RAP achieves 22% improvement in targeted attack success on Google Cloud Vision API.
RAP can be combined with existing black-box attack techniques for further gains.
Abstract
Deep neural networks (DNNs) have been shown to be vulnerable to adversarial examples, which can produce erroneous predictions by injecting imperceptible perturbations. In this work, we study the transferability of adversarial examples, which is significant due to its threat to real-world applications where model architecture or parameters are usually unknown. Many existing works reveal that the adversarial examples are likely to overfit the surrogate model that they are generated from, limiting its transfer attack performance against different target models. To mitigate the overfitting of the surrogate model, we propose a novel attack method, dubbed reverse adversarial perturbation (RAP). Specifically, instead of minimizing the loss of a single adversarial point, we advocate seeking adversarial example located at a region with unified low loss value, by injecting the worst-case…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Explainable Artificial Intelligence (XAI)
