TL;DR
This survey reviews safety challenges, threats, defenses, and evaluation methods for Vision-Language-Action models, emphasizing their embodied nature and the need for unified safety frameworks across stages.
Contribution
It provides a comprehensive, organized overview of VLA safety issues, categorizing threats and defenses along attack and defense timing axes, and highlights open problems in the field.
Findings
Identifies key safety threats at training and inference stages.
Reviews existing defenses and evaluation metrics for VLA models.
Highlights open problems like certified robustness and safety-aware training.
Abstract
Vision-Language-Action (VLA) models are emerging as a unified substrate for embodied intelligence. This shift raises a new class of safety challenges, stemming from the embodied nature of VLA systems, including irreversible physical consequences, a multimodal attack surface across vision, language, and state, real-time latency constraints on defense, error propagation over long-horizon trajectories, and vulnerabilities in the data supply chain. Yet the literature remains fragmented across robotic learning, adversarial machine learning, AI alignment, and autonomous systems safety. This survey provides a unified and up-to-date overview of safety in Vision-Language-Action models. We organize the field along two parallel timing axes, attack timing (training-time vs. inference-time and defense timing (training-time vs. inference-time, linking each class of threat to the stage at which it can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
