Embodied Red Teaming for Auditing Robotic Foundation Models
Sathwik Karnik, Zhang-Wei Hong, Nishant Abhangi, Yen-Chen Lin,, Tsun-Hsuan Wang, Christophe Dupuy, Rahul Gupta, Pulkit Agrawal

TL;DR
This paper introduces Embodied Red Teaming (ERT), a novel evaluation method using automated techniques to generate challenging instructions for testing the safety and robustness of language-conditioned robotic models.
Contribution
The paper presents ERT, a new automated red teaming approach that creates diverse, challenging instructions grounded in context to evaluate robotic models beyond traditional benchmarks.
Findings
State-of-the-art models often fail on ERT instructions.
Models exhibit unsafe behaviors on challenging instructions.
Current benchmarks do not adequately assess safety or robustness.
Abstract
Language-conditioned robot models have the potential to enable robots to perform a wide range of tasks based on natural language instructions. However, assessing their safety and effectiveness remains challenging because it is difficult to test all the different ways a single task can be phrased. Current benchmarks have two key limitations: they rely on a limited set of human-generated instructions, missing many challenging cases, and focus only on task performance without assessing safety, such as avoiding damage. To address these gaps, we introduce Embodied Red Teaming (ERT), a new evaluation method that generates diverse and challenging instructions to test these models. ERT uses automated red teaming techniques with Vision Language Models (VLMs) to create contextually grounded, difficult instructions. Experimental results show that state-of-the-art language-conditioned robot models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBIM and Construction Integration · 3D Modeling in Geospatial Applications
MethodsSparse Evolutionary Training · Focus
