Predictive Red Teaming: Breaking Policies Without Breaking Robots

Anirudha Majumdar; Mohit Sharma; Dmitry Kalashnikov; Sumeet Singh,; Pierre Sermanet; Vikas Sindhwani

arXiv:2502.06575·cs.RO·February 11, 2025

Predictive Red Teaming: Breaking Policies Without Breaking Robots

Anirudha Majumdar, Mohit Sharma, Dmitry Kalashnikov, Sumeet Singh,, Pierre Sermanet, Vikas Sindhwani

PDF

Open Access

TL;DR

This paper introduces RoboART, an automated pipeline for predicting visuomotor policy vulnerabilities to environmental changes, enabling efficient identification of failure scenarios without hardware testing.

Contribution

The paper presents RoboART, a novel generative image editing and anomaly detection framework for predictive red teaming of visuomotor policies in off-nominal conditions.

Findings

01

RoboART predicts performance degradation with less than 0.19 average difference from real success rates.

02

Targeted data collection based on RoboART predictions significantly improves policy performance.

03

The approach reduces the need for costly hardware evaluations in vulnerability assessment.

Abstract

Visuomotor policies trained via imitation learning are capable of performing challenging manipulation tasks, but are often extremely brittle to lighting, visual distractors, and object locations. These vulnerabilities can depend unpredictably on the specifics of training, and are challenging to expose without time-consuming and expensive hardware evaluations. We propose the problem of predictive red teaming: discovering vulnerabilities of a policy with respect to environmental factors, and predicting the corresponding performance degradation without hardware evaluations in off-nominal scenarios. In order to achieve this, we develop RoboART: an automated red teaming (ART) pipeline that (1) modifies nominal observations using generative image editing to vary different environmental factors, and (2) predicts performance under each variation using a policy-specific anomaly detector executed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI

MethodsDiffusion