One Object, Multiple Lies: A Benchmark for Cross-task Adversarial Attack on Unified Vision-Language Models

Jiale Zhao; Xinyang Jiang; Junyao Gao; Yuhao Xue; Cairong Zhao

arXiv:2507.07709·cs.CV·July 11, 2025

One Object, Multiple Lies: A Benchmark for Cross-task Adversarial Attack on Unified Vision-Language Models

Jiale Zhao, Xinyang Jiang, Junyao Gao, Yuhao Xue, Cairong Zhao

PDF

Open Access

TL;DR

This paper introduces CrossVLAD, a benchmark dataset and a novel attack framework to evaluate and improve cross-task adversarial attacks on unified vision-language models, revealing their vulnerabilities across multiple tasks.

Contribution

It presents a new benchmark dataset with GPT-4 annotations and a region-based attack method for assessing adversarial robustness of unified VLMs across tasks.

Findings

01

CRAFT outperforms existing attack methods in success rate

02

Unified VLMs are vulnerable to cross-task adversarial attacks

03

CrossVLAD provides a systematic evaluation framework for adversarial robustness

Abstract

Unified vision-language models(VLMs) have recently shown remarkable progress, enabling a single model to flexibly address diverse tasks through different instructions within a shared computational architecture. This instruction-based control mechanism creates unique security challenges, as adversarial inputs must remain effective across multiple task instructions that may be unpredictably applied to process the same malicious content. In this paper, we introduce CrossVLAD, a new benchmark dataset carefully curated from MSCOCO with GPT-4-assisted annotations for systematically evaluating cross-task adversarial attacks on unified VLMs. CrossVLAD centers on the object-change objective-consistently manipulating a target object's classification across four downstream tasks-and proposes a novel success rate metric that measures simultaneous misclassification across all tasks, providing a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Multimodal Machine Learning Applications · Ethics and Social Impacts of AI