Altered Thoughts, Altered Actions: Probing Chain-of-Thought Vulnerabilities in VLA Robotic Manipulation

Tuan Duong Trinh; Naveed Akhtar; Basim Azam

arXiv:2603.12717·cs.RO·March 16, 2026

Altered Thoughts, Altered Actions: Probing Chain-of-Thought Vulnerabilities in VLA Robotic Manipulation

Tuan Duong Trinh, Naveed Akhtar, Basim Azam

PDF

Open Access

TL;DR

This paper investigates vulnerabilities in chain-of-thought reasoning in Vision-Language-Action models for robotic manipulation, revealing that corrupting object references significantly impairs task success, unlike other text perturbations.

Contribution

It introduces a taxonomy of text corruptions and demonstrates that the action decoder relies heavily on entity references rather than reasoning quality, exposing a new stealthy attack vector.

Findings

01

Object name substitution reduces success rate by 8.3pp

02

Sentence reordering and token noise have negligible impact

03

LLM-based attacks are less effective than simple object-name swaps

Abstract

Recent Vision-Language-Action (VLA) models increasingly adopt chain-of-thought (CoT) reasoning, generating a natural-language plan before decoding motor commands. This internal text channel between the reasoning module and the action decoder has received no adversarial scrutiny. We ask: which properties of this intermediate plan does the action decoder actually rely on, and can targeted corruption of the reasoning trace alone -- with all inputs left intact -- degrade a robot's physical task performance? We design a taxonomy of seven text corruptions organized into three attacker tiers (blind noise, mechanical-semantic, and LLM-adaptive) and apply them to a state-of-the-art reasoning VLA across 40 LIBERO tabletop manipulation tasks. Our results reveal a striking asymmetry: substituting object names in the reasoning trace reduces overall success rate by 8.3~percentage points (pp) --…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Multimodal Machine Learning Applications · Robot Manipulation and Learning