PA-Attack: Guiding Gray-Box Attacks on LVLM Vision Encoders with Prototypes and Attention
Hefei Mei, Zirui Wang, Chang Xu, Jianyuan Guo, Minjing Dong

TL;DR
PA-Attack introduces a novel gray-box attack method on LVLMs that uses prototypes and attention mechanisms to improve attack transferability, efficiency, and task generalization across diverse models and tasks.
Contribution
It proposes a prototype-anchored guidance and a two-stage attention mechanism to enhance adversarial attacks on LVLMs, addressing limitations of prior white-box and black-box methods.
Findings
Achieves an average 75.1% score reduction rate across tasks.
Demonstrates strong attack effectiveness and efficiency.
Shows improved task generalization across LVLM architectures.
Abstract
Large Vision-Language Models (LVLMs) are foundational to modern multimodal applications, yet their susceptibility to adversarial attacks remains a critical concern. Prior white-box attacks rarely generalize across tasks, and black-box methods depend on expensive transfer, which limits efficiency. The vision encoder, standardized and often shared across LVLMs, provides a stable gray-box pivot with strong cross-model transfer. Building on this premise, we introduce PA-Attack (Prototype-Anchored Attentive Attack). PA-Attack begins with a prototype-anchored guidance that provides a stable attack direction towards a general and dissimilar prototype, tackling the attribute-restricted issue and limited task generalization of vanilla attacks. Building on this, we propose a two-stage attention enhancement mechanism: (i) leverage token-level attention scores to concentrate perturbations on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
