Boosting Adversarial Transferability via Ensemble Non-Attention
Yipeng Zou, Qin Liu, Jie Wu, Yu Peng, Guo Chen, Hui Zhou, Guanghui Ye

TL;DR
This paper introduces NAMEA, a novel ensemble attack method that leverages gradients from non-attention areas of models to improve adversarial transferability across heterogeneous architectures, especially CNNs and ViTs.
Contribution
It pioneers integrating non-attention area gradients into ensemble attacks and employs meta-learning to fuse transfer information, enhancing attack effectiveness across diverse models.
Findings
NAMEA outperforms state-of-the-art ensemble attacks by 15.0% and 9.6%.
Decoupling and merging attention and non-attention gradients improves transferability.
The approach provides new insights into ensemble attack strategies across different architectures.
Abstract
Ensemble attacks integrate the outputs of surrogate models with diverse architectures, which can be combined with various gradient-based attacks to improve adversarial transferability. However, previous work shows unsatisfactory attack performance when transferring across heterogeneous model architectures. The main reason is that the gradient update directions of heterogeneous surrogate models differ widely, making it hard to reduce the gradient variance of ensemble models while making the best of individual model. To tackle this challenge, we design a novel ensemble attack, NAMEA, which for the first time integrates the gradients from the non-attention areas of ensemble models into the iterative gradient optimization process. Our design is inspired by the observation that the attention areas of heterogeneous models vary sharply, thus the non-attention areas of ViTs are likely to be the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis
