Vision-Language-Action Safety: Threats, Challenges, Evaluations, and Mechanisms

Qi Li; Bo Yin; Weiqi Huang; Ruhao Liu; Bojun Zou; Runpeng Yu; Jingwen Ye; Weihao Yu; Xinchao Wang

arXiv:2604.23775·cs.RO·April 28, 2026

Vision-Language-Action Safety: Threats, Challenges, Evaluations, and Mechanisms

Qi Li, Bo Yin, Weiqi Huang, Ruhao Liu, Bojun Zou, Runpeng Yu, Jingwen Ye, Weihao Yu, Xinchao Wang

PDF

1 Repo

TL;DR

This survey reviews safety challenges, threats, defenses, and evaluation methods for Vision-Language-Action models, emphasizing their embodied nature and the need for unified safety frameworks across stages.

Contribution

It provides a comprehensive, organized overview of VLA safety issues, categorizing threats and defenses along attack and defense timing axes, and highlights open problems in the field.

Findings

01

Identifies key safety threats at training and inference stages.

02

Reviews existing defenses and evaluation metrics for VLA models.

03

Highlights open problems like certified robustness and safety-aware training.

Abstract

Vision-Language-Action (VLA) models are emerging as a unified substrate for embodied intelligence. This shift raises a new class of safety challenges, stemming from the embodied nature of VLA systems, including irreversible physical consequences, a multimodal attack surface across vision, language, and state, real-time latency constraints on defense, error propagation over long-horizon trajectories, and vulnerabilities in the data supply chain. Yet the literature remains fragmented across robotic learning, adversarial machine learning, AI alignment, and autonomous systems safety. This survey provides a unified and up-to-date overview of safety in Vision-Language-Action models. We organize the field along two parallel timing axes, attack timing (training-time vs. inference-time and defense timing (training-time vs. inference-time, linking each class of threat to the stage at which it can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liqiiiii/Awesome-VLA-Safety
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.