Set-level Guidance Attack: Boosting Adversarial Transferability of   Vision-Language Pre-training Models

Dong Lu; Zhiqiang Wang; Teng Wang; Weili Guan; Hongchang Gao; Feng; Zheng

arXiv:2307.14061·cs.CV·July 27, 2023·2 cites

Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models

Dong Lu, Zhiqiang Wang, Teng Wang, Weili Guan, Hongchang Gao, Feng, Zheng

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel attack method called Set-level Guidance Attack (SGA) that significantly improves the transferability of adversarial examples across vision-language pre-training models by leveraging cross-modal interactions and alignment-preserving augmentations.

Contribution

The paper presents the first study on adversarial transferability in VLP models and proposes SGA, a highly transferable attack method that outperforms existing approaches in cross-model attacks.

Findings

01

SGA significantly increases transfer attack success rates.

02

Cross-modal interactions are crucial for transferability.

03

SGA outperforms state-of-the-art methods in experiments.

Abstract

Vision-language pre-training (VLP) models have shown vulnerability to adversarial examples in multimodal tasks. Furthermore, malicious adversaries can be deliberately transferred to attack other black-box models. However, existing work has mainly focused on investigating white-box attacks. In this paper, we present the first study to investigate the adversarial transferability of recent VLP models. We observe that existing methods exhibit much lower transferability, compared to the strong attack performance in white-box settings. The transferability degradation is partly caused by the under-utilization of cross-modal interactions. Particularly, unlike unimodal learning, VLP models rely heavily on cross-modal interactions and the multimodal alignments are many-to-many, e.g., an image can be described in various natural languages. To this end, we propose a highly transferable Set-level…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Zoky-2020/Set-level_Guidance_Attack
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning

MethodsALBEF