Pushing the Frontier of Black-Box LVLM Attacks via Fine-Grained Detail Targeting

Xiaohan Zhao; Zhaoyi Li; Yaxin Luo; Jiacheng Cui; Zhiqiang Shen

arXiv:2602.17645·cs.LG·February 20, 2026

Pushing the Frontier of Black-Box LVLM Attacks via Fine-Grained Detail Targeting

Xiaohan Zhao, Zhaoyi Li, Yaxin Luo, Jiacheng Cui, Zhiqiang Shen

PDF

Open Access 1 Datasets

TL;DR

This paper introduces M-Attack-V2, an improved black-box attack method on LVLMs that uses multi-crop and auxiliary target alignment to significantly increase attack success rates.

Contribution

The paper proposes a modular enhancement over prior transfer-based attacks, incorporating multi-crop gradient averaging and auxiliary target alignment to improve transferability and stability.

Findings

01

Boosts attack success rates on LVLMs significantly.

02

Outperforms prior black-box attack methods.

03

Achieves near-perfect success on GPT-5.

Abstract

Black-box adversarial attacks on Large Vision-Language Models (LVLMs) are challenging due to missing gradients and complex multimodal boundaries. While prior state-of-the-art transfer-based approaches like M-Attack perform well using local crop-level matching between source and target images, we find this induces high-variance, nearly orthogonal gradients across iterations, violating coherent local alignment and destabilizing optimization. We attribute this to (i) ViT translation sensitivity that yields spike-like gradients and (ii) structural asymmetry between source and target crops. We reformulate local matching as an asymmetric expectation over source transformations and target semantics, and build a gradient-denoising upgrade to M-Attack. On the source side, Multi-Crop Alignment (MCA) averages gradients from multiple independently sampled local views per iteration to reduce…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

MBZUAI-LLM/M-Attack-V2-Adversarial-Samples
dataset· 206 dl
206 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · COVID-19 diagnosis using AI