Making Adversarial Examples More Transferable and Indistinguishable
Junhua Zou, Yexin Duan, Boyu Li, Wu Zhang, Yu Pan, Zhisong Pan

TL;DR
This paper introduces AI-FGTM, a new method for generating adversarial examples that are both highly transferable and indistinguishable, improving attack success rates and reducing perturbation.
Contribution
The paper proposes AI-FGTM, a novel adversarial attack method that enhances transferability and indistinguishability without extra computational cost.
Findings
Achieves an average attack success rate of 89.3% on six defense models.
Reduces mean perturbation by nearly 20%.
Outperforms state-of-the-art gradient-based attacks.
Abstract
Fast gradient sign attack series are popular methods that are used to generate adversarial examples. However, most of the approaches based on fast gradient sign attack series cannot balance the indistinguishability and transferability due to the limitations of the basic sign structure. To address this problem, we propose a method, called Adam Iterative Fast Gradient Tanh Method (AI-FGTM), to generate indistinguishable adversarial examples with high transferability. Besides, smaller kernels and dynamic step size are also applied to generate adversarial examples for further increasing the attack success rates. Extensive experiments on an ImageNet-compatible dataset show that our method generates more indistinguishable adversarial examples and achieves higher attack success rates without extra running time and resource. Our best transfer-based attack NI-TI-DI-AITM can fool six classic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
MethodsAdam
