Chain of Attack: On the Robustness of Vision-Language Models Against   Transfer-Based Adversarial Attacks

Peng Xie; Yequan Bie; Jianda Mao; Yangqiu Song; Yang Wang; Hao Chen,; Kani Chen

arXiv:2411.15720·cs.CV·November 26, 2024

Chain of Attack: On the Robustness of Vision-Language Models Against Transfer-Based Adversarial Attacks

Peng Xie, Yequan Bie, Jianda Mao, Yangqiu Song, Yang Wang, Hao Chen,, Kani Chen

PDF

Open Access

TL;DR

This paper introduces Chain of Attack (CoA), a novel method for improving transfer-based adversarial attacks on vision-language models by leveraging multi-modal semantic updates, revealing significant vulnerabilities in these models.

Contribution

We propose CoA, an iterative adversarial attack method that enhances transferability by considering semantic correlations between vision and language modalities.

Findings

01

CoA achieves higher attack success rates than existing methods.

02

The method effectively misleads models to generate targeted responses.

03

Robustness evaluation reveals significant vulnerabilities in current VLMs.

Abstract

Pre-trained vision-language models (VLMs) have showcased remarkable performance in image and natural language understanding, such as image captioning and response generation. As the practical applications of vision-language models become increasingly widespread, their potential safety and robustness issues raise concerns that adversaries may evade the system and cause these models to generate toxic content through malicious attacks. Therefore, evaluating the robustness of open-source VLMs against adversarial attacks has garnered growing attention, with transfer-based attacks as a representative black-box attacking strategy. However, most existing transfer-based attacks neglect the importance of the semantic correlations between vision and text modalities, leading to sub-optimal adversarial example generation and attack performance. To address this issue, we present Chain of Attack…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research