Predictive Red Teaming: Breaking Policies Without Breaking Robots
Anirudha Majumdar, Mohit Sharma, Dmitry Kalashnikov, Sumeet Singh,, Pierre Sermanet, Vikas Sindhwani

TL;DR
This paper introduces RoboART, an automated pipeline for predicting visuomotor policy vulnerabilities to environmental changes, enabling efficient identification of failure scenarios without hardware testing.
Contribution
The paper presents RoboART, a novel generative image editing and anomaly detection framework for predictive red teaming of visuomotor policies in off-nominal conditions.
Findings
RoboART predicts performance degradation with less than 0.19 average difference from real success rates.
Targeted data collection based on RoboART predictions significantly improves policy performance.
The approach reduces the need for costly hardware evaluations in vulnerability assessment.
Abstract
Visuomotor policies trained via imitation learning are capable of performing challenging manipulation tasks, but are often extremely brittle to lighting, visual distractors, and object locations. These vulnerabilities can depend unpredictably on the specifics of training, and are challenging to expose without time-consuming and expensive hardware evaluations. We propose the problem of predictive red teaming: discovering vulnerabilities of a policy with respect to environmental factors, and predicting the corresponding performance degradation without hardware evaluations in off-nominal scenarios. In order to achieve this, we develop RoboART: an automated red teaming (ART) pipeline that (1) modifies nominal observations using generative image editing to vary different environmental factors, and (2) predicts performance under each variation using a policy-specific anomaly detector executed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI
MethodsDiffusion
