Altered Thoughts, Altered Actions: Probing Chain-of-Thought Vulnerabilities in VLA Robotic Manipulation
Tuan Duong Trinh, Naveed Akhtar, Basim Azam

TL;DR
This paper investigates vulnerabilities in chain-of-thought reasoning in Vision-Language-Action models for robotic manipulation, revealing that corrupting object references significantly impairs task success, unlike other text perturbations.
Contribution
It introduces a taxonomy of text corruptions and demonstrates that the action decoder relies heavily on entity references rather than reasoning quality, exposing a new stealthy attack vector.
Findings
Object name substitution reduces success rate by 8.3pp
Sentence reordering and token noise have negligible impact
LLM-based attacks are less effective than simple object-name swaps
Abstract
Recent Vision-Language-Action (VLA) models increasingly adopt chain-of-thought (CoT) reasoning, generating a natural-language plan before decoding motor commands. This internal text channel between the reasoning module and the action decoder has received no adversarial scrutiny. We ask: which properties of this intermediate plan does the action decoder actually rely on, and can targeted corruption of the reasoning trace alone -- with all inputs left intact -- degrade a robot's physical task performance? We design a taxonomy of seven text corruptions organized into three attacker tiers (blind noise, mechanical-semantic, and LLM-adaptive) and apply them to a state-of-the-art reasoning VLA across 40 LIBERO tabletop manipulation tasks. Our results reveal a striking asymmetry: substituting object names in the reasoning trace reduces overall success rate by 8.3~percentage points (pp) --…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Multimodal Machine Learning Applications · Robot Manipulation and Learning
