PA-Attack: Guiding Gray-Box Attacks on LVLM Vision Encoders with Prototypes and Attention

Hefei Mei; Zirui Wang; Chang Xu; Jianyuan Guo; Minjing Dong

arXiv:2602.19418·cs.CV·February 24, 2026

PA-Attack: Guiding Gray-Box Attacks on LVLM Vision Encoders with Prototypes and Attention

Hefei Mei, Zirui Wang, Chang Xu, Jianyuan Guo, Minjing Dong

PDF

Open Access

TL;DR

PA-Attack introduces a novel gray-box attack method on LVLMs that uses prototypes and attention mechanisms to improve attack transferability, efficiency, and task generalization across diverse models and tasks.

Contribution

It proposes a prototype-anchored guidance and a two-stage attention mechanism to enhance adversarial attacks on LVLMs, addressing limitations of prior white-box and black-box methods.

Findings

01

Achieves an average 75.1% score reduction rate across tasks.

02

Demonstrates strong attack effectiveness and efficiency.

03

Shows improved task generalization across LVLM architectures.

Abstract

Large Vision-Language Models (LVLMs) are foundational to modern multimodal applications, yet their susceptibility to adversarial attacks remains a critical concern. Prior white-box attacks rarely generalize across tasks, and black-box methods depend on expensive transfer, which limits efficiency. The vision encoder, standardized and often shared across LVLMs, provides a stable gray-box pivot with strong cross-model transfer. Building on this premise, we introduce PA-Attack (Prototype-Anchored Attentive Attack). PA-Attack begins with a prototype-anchored guidance that provides a stable attack direction towards a general and dissimilar prototype, tackling the attribute-restricted issue and limited task generalization of vanilla attacks. Building on this, we propose a two-stage attention enhancement mechanism: (i) leverage token-level attention scores to concentrate perturbations on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications