Adversarial Attacks on Robotic Vision Language Action Models

Eliot Krzysztof Jones; Alexander Robey; Andy Zou; Zachary Ravichandran; George J. Pappas; Hamed Hassani; Matt Fredrikson; and J. Zico Kolter

arXiv:2506.03350·cs.RO·June 5, 2025

Adversarial Attacks on Robotic Vision Language Action Models

Eliot Krzysztof Jones, Alexander Robey, Andy Zou, Zachary Ravichandran, George J. Pappas, Hamed Hassani, Matt Fredrikson, and J. Zico Kolter

PDF

Open Access 1 Repo

TL;DR

This paper investigates the vulnerability of vision-language-action models (VLAs) used in robotics to adversarial attacks, demonstrating that textual jailbreak attacks can fully control these models and persist over time, raising safety concerns.

Contribution

The study adapts LLM jailbreaking techniques to robotic VLAs, revealing their susceptibility to adversarial control and highlighting the need for robustness in robotic systems.

Findings

01

Textual attacks enable full control over VLAs.

02

Attacks often persist over longer operational horizons.

03

Vulnerabilities pose safety risks in robotic applications.

Abstract

The emergence of vision-language-action models (VLAs) for end-to-end control is reshaping the field of robotics by enabling the fusion of multimodal sensory inputs at the billion-parameter scale. The capabilities of VLAs stem primarily from their architectures, which are often based on frontier large language models (LLMs). However, LLMs are known to be susceptible to adversarial misuse, and given the significant physical risks inherent to robotics, questions remain regarding the extent to which VLAs inherit these vulnerabilities. Motivated by these concerns, in this work we initiate the study of adversarial attacks on VLA-controlled robots. Our main algorithmic contribution is the adaptation and application of LLM jailbreaking attacks to obtain complete control authority over VLAs. We find that textual attacks, which are applied once at the beginning of a rollout, facilitate full…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

eliotjones1/robogcg
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Multimodal Machine Learning Applications · Ethics and Social Impacts of AI