On the Vulnerability of LLM/VLM-Controlled Robotics

Xiyang Wu; Souradip Chakraborty; Ruiqi Xian; Jing Liang; Tianrui Guan,; Fuxiao Liu; Brian M. Sadler; Dinesh Manocha; Amrit Singh Bedi

arXiv:2402.10340·cs.RO·March 10, 2025·2 cites

On the Vulnerability of LLM/VLM-Controlled Robotics

Xiyang Wu, Souradip Chakraborty, Ruiqi Xian, Jing Liang, Tianrui Guan,, Fuxiao Liu, Brian M. Sadler, Dinesh Manocha, Amrit Singh Bedi

PDF

Open Access 1 Repo

TL;DR

This paper investigates the vulnerabilities of LLM/VLM-controlled robots to input variations, revealing significant failure rates and emphasizing the need for robustness to ensure safe deployment.

Contribution

It introduces a mathematical framework for failure modes and empirical perturbation strategies to expose vulnerabilities in LLM/VLM robotic systems.

Findings

01

Input perturbations reduce success rates by over 20%.

02

Vulnerabilities are consistent across multiple manipulation tasks.

03

Highlights the critical need for input robustness in robotic systems.

Abstract

In this work, we highlight vulnerabilities in robotic systems integrating large language models (LLMs) and vision-language models (VLMs) due to input modality sensitivities. While LLM/VLM-controlled robots show impressive performance across various tasks, their reliability under slight input variations remains underexplored yet critical. These models are highly sensitive to instruction or perceptual input changes, which can trigger misalignment issues, leading to execution failures with severe real-world consequences. To study this issue, we analyze the misalignment-induced vulnerabilities within LLM/VLM-controlled robotic systems and present a mathematical formulation for failure modes arising from variations in input modalities. We propose empirical perturbation strategies to expose these vulnerabilities and validate their effectiveness through experiments on multiple robot…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gt-ripl/awesome-llm-robotics
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModular Robots and Swarm Intelligence

MethodsFocus