Chameleon: Adaptive Adversarial Agents for Scaling-Based Visual Prompt Injection in Multimodal AI Systems
M Zeeshan, Saud Satti

TL;DR
Chameleon introduces an adaptive adversarial framework that exploits scaling vulnerabilities in Vision-Language Models, revealing significant security risks in multimodal AI systems and highlighting the need for robust defenses.
Contribution
This paper presents Chameleon, a novel, iterative, agent-based attack method that dynamically crafts robust adversarial examples to exploit scaling vulnerabilities in production VLMs.
Findings
Achieves 84.5% attack success rate against Gemini 2.5 Flash model.
Reduces downstream decision accuracy by over 45%.
Outperforms static baseline attacks significantly.
Abstract
Multimodal Artificial Intelligence (AI) systems, particularly Vision-Language Models (VLMs), have become integral to critical applications ranging from autonomous decision-making to automated document processing. As these systems scale, they rely heavily on preprocessing pipelines to handle diverse inputs efficiently. However, this dependency on standard preprocessing operations, specifically image downscaling, creates a significant yet often overlooked security vulnerability. While intended for computational optimization, scaling algorithms can be exploited to conceal malicious visual prompts that are invisible to human observers but become active semantic instructions once processed by the model. Current adversarial strategies remain largely static, failing to account for the dynamic nature of modern agentic workflows. To address this gap, we propose Chameleon, a novel, adaptive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Security and Verification in Computing
