Improving Adversarial Transferability in MLLMs via Dynamic   Vision-Language Alignment Attack

Chenhe Gu; Jindong Gu; Andong Hua; Yao Qin

arXiv:2502.19672·cs.CV·February 28, 2025

Improving Adversarial Transferability in MLLMs via Dynamic Vision-Language Alignment Attack

Chenhe Gu, Jindong Gu, Andong Hua, Yao Qin

PDF

Open Access

TL;DR

This paper introduces DynVLA, a novel adversarial attack method that enhances transferability across diverse multimodal large language models by dynamically perturbing vision-language alignment components.

Contribution

The paper proposes DynVLA, a new dynamic perturbation approach targeting vision-language alignment to improve adversarial transferability in MLLMs.

Findings

01

DynVLA significantly improves attack transferability across multiple MLLMs.

02

The method is effective on both open-source and closed-source models.

03

Experimental results demonstrate enhanced robustness of adversarial examples.

Abstract

Multimodal Large Language Models (MLLMs), built upon LLMs, have recently gained attention for their capabilities in image recognition and understanding. However, while MLLMs are vulnerable to adversarial attacks, the transferability of these attacks across different models remains limited, especially under targeted attack setting. Existing methods primarily focus on vision-specific perturbations but struggle with the complex nature of vision-language modality alignment. In this work, we introduce the Dynamic Vision-Language Alignment (DynVLA) Attack, a novel approach that injects dynamic perturbations into the vision-language connector to enhance generalization across diverse vision-language alignment of different models. Our experimental results show that DynVLA significantly improves the transferability of adversarial examples across various MLLMs, including BLIP2, InstructBLIP,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Multimodal Machine Learning Applications · Topic Modeling

MethodsSoftmax · Attention Is All You Need · Focus