X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP

Hanxun Huang; Sarah Erfani; Yige Li; Xingjun Ma; James Bailey

arXiv:2505.05528·cs.CV·June 2, 2025

X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP

Hanxun Huang, Sarah Erfani, Yige Li, Xingjun Ma, James Bailey

PDF

Open Access 1 Repo 10 Models 1 Video

TL;DR

This paper introduces X-Transfer, a novel attack method that creates a universal adversarial perturbation capable of deceiving various CLIP models and vision-language systems across multiple domains and tasks, revealing a super transferability vulnerability.

Contribution

X-Transfer presents a new scalable surrogate scaling technique to generate super transferable universal adversarial perturbations for CLIP models, outperforming previous methods.

Findings

01

X-Transfer achieves superior transferability across models and tasks.

02

The method significantly outperforms existing UAP techniques.

03

It establishes a new benchmark for adversarial attacks on CLIP.

Abstract

As Contrastive Language-Image Pre-training (CLIP) models are increasingly adopted for diverse downstream tasks and integrated into large vision-language models (VLMs), their susceptibility to adversarial perturbations has emerged as a critical concern. In this work, we introduce \textbf{X-Transfer}, a novel attack method that exposes a universal adversarial vulnerability in CLIP. X-Transfer generates a Universal Adversarial Perturbation (UAP) capable of deceiving various CLIP encoders and downstream VLMs across different samples, tasks, and domains. We refer to this property as \textbf{super transferability}--a single perturbation achieving cross-data, cross-domain, cross-model, and cross-task adversarial transferability simultaneously. This is achieved through \textbf{surrogate scaling}, a key innovation of our approach. Unlike existing methods that rely on fixed surrogate models,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HanxunH/XTransferBench
pytorchOfficial

Models

Videos

X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Domain Adaptation and Few-Shot Learning

MethodsContrastive Language-Image Pre-training