One Surrogate to Fool Them All: Universal, Transferable, and Targeted Adversarial Attacks with CLIP

Binyan Xu; Xilin Dai; Di Tang; and Kehuan Zhang

arXiv:2505.19840·cs.CR·July 9, 2025

One Surrogate to Fool Them All: Universal, Transferable, and Targeted Adversarial Attacks with CLIP

Binyan Xu, Xilin Dai, Di Tang, and Kehuan Zhang

PDF

Open Access 1 Repo 2 Models

TL;DR

This paper introduces UnivIntruder, a novel adversarial attack method that uses a single CLIP model and textual concepts to generate universal, transferable, and targeted adversarial examples, exposing vulnerabilities in various AI systems without needing target model queries.

Contribution

The paper presents UnivIntruder, a new attack framework that leverages CLIP and textual concepts to craft effective adversarial examples without access to target models or training data.

Findings

01

Achieves up to 85% attack success rate on ImageNet

02

Successfully compromises image search engines and language models

03

Outperforms existing transfer-based attack methods

Abstract

Deep Neural Networks (DNNs) have achieved widespread success yet remain prone to adversarial attacks. Typically, such attacks either involve frequent queries to the target model or rely on surrogate models closely mirroring the target model -- often trained with subsets of the target model's training data -- to achieve high attack success rates through transferability. However, in realistic scenarios where training data is inaccessible and excessive queries can raise alarms, crafting adversarial examples becomes more challenging. In this paper, we present UnivIntruder, a novel attack framework that relies solely on a single, publicly available CLIP model and publicly available datasets. By using textual concepts, UnivIntruder generates universal, transferable, and targeted adversarial perturbations that mislead DNNs into misclassifying inputs into adversary-specified classes defined by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

binyxu/univintruder
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Security and Verification in Computing

MethodsAttention Is All You Need · Linear Layer · Dense Connections · Softmax · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Label Smoothing · Multi-Head Attention · Layer Normalization · Byte Pair Encoding