Evaluating the Robustness of Multimodal Agents Against Active Environmental Injection Attacks
Yurun Chen, Xavier Hu, Keting Yin, Juncheng Li, Shengyu Zhang

TL;DR
This paper introduces Active Environment Injection Attacks (AEIA), a new security threat to multimodal AI agents in mobile OS environments, demonstrating their high vulnerability with a 93% success rate.
Contribution
The paper identifies a novel security threat, AEIA, analyzes vulnerabilities in Android OS-based agents, and proposes an attack scheme to evaluate their robustness.
Findings
MLLM-based agents are highly vulnerable to AEIA.
Maximum attack success rate of 93% on AndroidWorld benchmark.
Two critical vulnerabilities: adversarial content injection and reasoning gaps.
Abstract
As researchers continue to optimize AI agents for more effective task execution within operating systems, they often overlook a critical security concern: the ability of these agents to detect "impostors" within their environment. Through an analysis of the agents' operational context, we identify a significant threat-attackers can disguise malicious attacks as environmental elements, injecting active disturbances into the agents' execution processes to manipulate their decision-making. We define this novel threat as the Active Environment Injection Attack (AEIA). Focusing on the interaction mechanisms of the Android OS, we conduct a risk assessment of AEIA and identify two critical security vulnerabilities: (1) Adversarial content injection in multimodal interaction interfaces, where attackers embed adversarial instructions within environmental elements to mislead agent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Mobile Agent-Based Network Management · Advanced Malware Detection Techniques
