Boosting Adversarial Transferability via Residual Perturbation Attack
Jinjia Peng, Zeze Tao, Huibing Wang, Meng Wang, Yang Wang

TL;DR
This paper introduces Residual Perturbation Attack (ResPA), a novel method that improves adversarial transferability by guiding perturbations towards flat loss regions using residual gradients, outperforming existing methods.
Contribution
ResPA leverages residual gradients and exponential moving averages to enhance transferability of adversarial examples in black-box attacks, addressing limitations of prior flatness-based approaches.
Findings
ResPA achieves superior transferability compared to existing methods.
Combining ResPA with input transformations further improves attack success.
ResPA effectively guides adversarial examples toward flat loss regions.
Abstract
Deep neural networks are susceptible to adversarial examples while suffering from incorrect predictions via imperceptible perturbations. Transfer-based attacks create adversarial examples for surrogate models and transfer these examples to target models under black-box scenarios. Recent studies reveal that adversarial examples in flat loss landscapes exhibit superior transferability to alleviate overfitting on surrogate models. However, the prior arts overlook the influence of perturbation directions, resulting in limited transferability. In this paper, we propose a novel attack method, named Residual Perturbation Attack (ResPA), relying on the residual gradient as the perturbation direction to guide the adversarial examples toward the flat regions of the loss function. Specifically, ResPA conducts an exponential moving average on the input gradients to obtain the first moment as the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
