Break the Visual Perception: Adversarial Attacks Targeting Encoded   Visual Tokens of Large Vision-Language Models

Yubo Wang; Chaohu Liu; Yanqiu Qu; Haoyu Cao; Deqiang Jiang; Linli Xu

arXiv:2410.06699·cs.CV·October 10, 2024

Break the Visual Perception: Adversarial Attacks Targeting Encoded Visual Tokens of Large Vision-Language Models

Yubo Wang, Chaohu Liu, Yanqiu Qu, Haoyu Cao, Deqiang Jiang, Linli Xu

PDF

Open Access

TL;DR

This paper introduces VT-Attack, a novel adversarial attack targeting the visual tokens of large vision-language models, revealing vulnerabilities in their visual feature representations and highlighting robustness challenges.

Contribution

The paper proposes a new non-targeted attack method, VT-Attack, that effectively disrupts visual tokens in LVLMs using only image encoder access, demonstrating transferability and generality.

Findings

01

VT-Attack outperforms baseline methods in attack success.

02

Adversarial examples transfer across models with the same encoder.

03

The attack exposes robustness issues in LVLMs' visual feature space.

Abstract

Large vision-language models (LVLMs) integrate visual information into large language models, showcasing remarkable multi-modal conversational capabilities. However, the visual modules introduces new challenges in terms of robustness for LVLMs, as attackers can craft adversarial images that are visually clean but may mislead the model to generate incorrect answers. In general, LVLMs rely on vision encoders to transform images into visual tokens, which are crucial for the language models to perceive image contents effectively. Therefore, we are curious about one question: Can LVLMs still generate correct responses when the encoded visual tokens are attacked and disrupting the visual information? To this end, we propose a non-targeted attack method referred to as VT-Attack (Visual Tokens Attack), which constructs adversarial examples from multiple perspectives, with the goal of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning