TRAP: Hijacking VLA CoT-Reasoning via Adversarial Patches

Zhengxian Huang; Wenjun Zhu; Haoxuan Qiu; Xiaoyu Ji; Wenyuan Xu

arXiv:2603.23117·cs.CR·March 25, 2026

TRAP: Hijacking VLA CoT-Reasoning via Adversarial Patches

Zhengxian Huang, Wenjun Zhu, Haoxuan Qiu, Xiaoyu Ji, Wenyuan Xu

PDF

Open Access

TL;DR

This paper reveals a vulnerability in vision-language-action models with Chain-of-Thought reasoning, demonstrating how adversarial patches can hijack robotic actions without altering user instructions, highlighting security concerns.

Contribution

The paper introduces TRAP, the first targeted adversarial attack framework that manipulates CoT reasoning in VLA models using physical patches, exposing security vulnerabilities.

Findings

01

TRAP effectively hijacks VLA model outputs across multiple architectures.

02

Physical patches printed on paper can successfully attack real-world systems.

03

CoT reasoning strongly influences action generation, even when misaligned with instructions.

Abstract

By integrating Chain-of-Thought(CoT) reasoning, Vision-Language-Action (VLA) models have demonstrated strong capabilities in robotic manipulation, particularly by improving generalization and interpretability. However, the security of CoT-based reasoning mechanisms remains largely unexplored. In this paper, we show that CoT reasoning introduces a novel attack vector for targeted control hijacking--for example, causing a robot to mistakenly deliver a knife to a person instead of an apple--without modifying the user's instruction. We first provide empirical evidence that CoT strongly governs action generation, even when it is semantically misaligned with the input instructions. Building on this observation, we propose TRAP, the first targeted adversarial attack framework for CoT-reasoning VLA models. TRAP uses an adversarial patch (e.g., a coaster placed on the table) to corrupt…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Multimodal Machine Learning Applications · Ethics and Social Impacts of AI