Poison Once, Control Anywhere: Clean-Text Visual Backdoors in VLM-based Mobile Agents

Xuan Wang; Siyuan Liang; Zhe Liu; Yi Yu; Aishan Liu; Yuliang Lu; Xitong Gao; Ee-Chien Chang

arXiv:2506.13205·cs.CR·September 8, 2025

Poison Once, Control Anywhere: Clean-Text Visual Backdoors in VLM-based Mobile Agents

Xuan Wang, Siyuan Liang, Zhe Liu, Yi Yu, Aishan Liu, Yuliang Lu, Xitong Gao, Ee-Chien Chang

PDF

TL;DR

This paper introduces VIBMA, a novel clean-text backdoor attack on VLM-based mobile agents that manipulates visual inputs to embed stealthy malicious behaviors without altering textual prompts, exposing security vulnerabilities.

Contribution

It presents the first attack targeting VLM-based mobile agents using visual triggers, demonstrating high success rates and stealthiness, and highlights security risks in mobile agent fine-tuning.

Findings

01

High attack success rate (up to 94.67%)

02

Maintains task performance with minimal impact (up to 95.85% FSR)

03

Effective triggers include static, dynamic, and blended patterns

Abstract

Mobile agents powered by vision-language models (VLMs) are increasingly adopted for tasks such as UI automation and camera-based assistance. These agents are typically fine-tuned using small-scale, user-collected data, making them susceptible to stealthy training-time threats. This work introduces VIBMA, the first clean-text backdoor attack targeting VLM-based mobile agents. The attack injects malicious behaviors into the model by modifying only the visual input while preserving textual prompts and instructions, achieving stealth through the complete absence of textual anomalies. Once the agent is fine-tuned on this poisoned data, adding a predefined visual pattern (trigger) at inference time activates the attacker-specified behavior (backdoor). Our attack aligns the training gradients of poisoned samples with those of an attacker-specified target instance, effectively embedding…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.