Adaptive Perturbation for Adversarial Attack
Zheng Yuan, Jie Zhang, Zhaoyan Jiang, Liangliang Li, Shiguang Shan

TL;DR
This paper introduces an adaptive perturbation method that uses exact gradient directions instead of sign functions, improving the success and transferability of adversarial attacks on neural networks.
Contribution
It proposes a novel approach using exact gradient directions with adaptive scaling, enhancing attack success rates and transferability over existing methods.
Findings
Higher attack success rates on CIFAR10 and ImageNet.
Improved black-box transferability of adversarial examples.
Compatible with most gradient-based attack methods.
Abstract
In recent years, the security of deep learning models achieves more and more attentions with the rapid development of neural networks, which are vulnerable to adversarial examples. Almost all existing gradient-based attack methods use the sign function in the generation to meet the requirement of perturbation budget on norm. However, we find that the sign function may be improper for generating adversarial examples since it modifies the exact gradient direction. Instead of using the sign function, we propose to directly utilize the exact gradient direction with a scaling factor for generating adversarial perturbations, which improves the attack success rates of adversarial examples even with fewer perturbations. At the same time, we also theoretically prove that this method can achieve better black-box transferability. Moreover, considering that the best scaling factor varies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
