CLIP-Guided Generative Networks for Transferable Targeted Adversarial Attacks
Hao Fang, Jiawei Kong, Bin Chen, Tao Dai, Hao Wu, Shu-Tao Xia

TL;DR
This paper introduces a CLIP-guided generative network with cross-attention modules to improve the transferability and efficiency of multi-target targeted adversarial attacks, leveraging semantic information from text.
Contribution
The paper proposes a novel CLIP-guided generative network with cross-attention modules (CGNC) that enhances multi-target adversarial attacks by incorporating textual knowledge, outperforming previous methods.
Findings
21.46% success rate improvement over previous methods
Effective in both multi-target and single-target attack scenarios
Demonstrates significant transferability and robustness
Abstract
Transferable targeted adversarial attacks aim to mislead models into outputting adversary-specified predictions in black-box scenarios. Recent studies have introduced \textit{single-target} generative attacks that train a generator for each target class to generate highly transferable perturbations, resulting in substantial computational overhead when handling multiple classes. \textit{Multi-target} attacks address this by training only one class-conditional generator for multiple classes. However, the generator simply uses class labels as conditions, failing to leverage the rich semantic information of the target class. To this end, we design a \textbf{C}LIP-guided \textbf{G}enerative \textbf{N}etwork with \textbf{C}ross-attention modules (CGNC) to enhance multi-target attacks by incorporating textual knowledge of CLIP into the generator. Extensive experiments demonstrate that CGNC…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware-Defined Networks and 5G · Network Security and Intrusion Detection · Radiation Effects in Electronics
MethodsContrastive Language-Image Pre-training
