AnyAttack: Towards Large-scale Self-supervised Adversarial Attacks on   Vision-language Models

Jiaming Zhang; Junhong Ye; Xingjun Ma; Yige Li; Yunfan Yang; Yunhao; Chen; Jitao Sang; Dit-Yan Yeung

arXiv:2410.05346·cs.LG·March 31, 2025

AnyAttack: Towards Large-scale Self-supervised Adversarial Attacks on Vision-language Models

Jiaming Zhang, Junhong Ye, Xingjun Ma, Yige Li, Yunfan Yang, Yunhao, Chen, Jitao Sang, Dit-Yan Yeung

PDF

Open Access 1 Models

TL;DR

AnyAttack introduces a self-supervised, large-scale adversarial attack framework that can target any output in vision-language models without specific labels, exposing systemic vulnerabilities across multiple systems.

Contribution

It presents a novel foundation model approach trained on unlabeled data, enabling flexible, scalable adversarial attacks on diverse vision-language models and commercial systems.

Findings

01

Effective across five open-source VLMs

02

Transfers successfully to commercial systems

03

Reveals systemic vulnerabilities in VLMs

Abstract

Due to their multimodal capabilities, Vision-Language Models (VLMs) have found numerous impactful applications in real-world scenarios. However, recent studies have revealed that VLMs are vulnerable to image-based adversarial attacks. Traditional targeted adversarial attacks require specific targets and labels, limiting their real-world impact.We present AnyAttack, a self-supervised framework that transcends the limitations of conventional attacks through a novel foundation model approach. By pre-training on the massive LAION-400M dataset without label supervision, AnyAttack achieves unprecedented flexibility - enabling any image to be transformed into an attack vector targeting any desired output across different VLMs.This approach fundamentally changes the threat landscape, making adversarial capabilities accessible at an unprecedented scale. Our extensive validation across five…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
jiamingzz/anyattack
model· ♡ 1
♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Cosine Annealing · Residual Connection · Linear Layer · Linear Warmup With Cosine Annealing · Discriminative Fine-Tuning · Weight Decay · Softmax · Attention Dropout