Asking for Help: Failure Prediction in Behavioral Cloning through Value   Approximation

Cem Gokmen; Daniel Ho; Mohi Khansari

arXiv:2302.04334·cs.RO·February 10, 2023

Asking for Help: Failure Prediction in Behavioral Cloning through Value Approximation

Cem Gokmen, Daniel Ho, Mohi Khansari

PDF

Open Access

TL;DR

This paper introduces BCVA, a method for predicting failures in behavioral cloning policies by learning a state value function, enabling robots to request help and improve autonomous operation in real-world tasks.

Contribution

The paper presents BCVA, a novel approach to predict failures in behavioral cloning by jointly learning a value function, enhancing failure detection in robotic manipulation tasks.

Findings

01

Achieved 86% precision and 81% recall in failure prediction.

02

Improved failure detection accuracy by 10 percentage points over baseline.

03

Validated on over 2000 real-world robot runs.

Abstract

Recent progress in end-to-end Imitation Learning approaches has shown promising results and generalization capabilities on mobile manipulation tasks. Such models are seeing increasing deployment in real-world settings, where scaling up requires robots to be able to operate with high autonomy, i.e. requiring as little human supervision as possible. In order to avoid the need for one-on-one human supervision, robots need to be able to detect and prevent policy failures ahead of time, and ask for help, allowing a remote operator to supervise multiple robots and help when needed. However, the black-box nature of end-to-end Imitation Learning models such as Behavioral Cloning, as well as the lack of an explicit state-value representation, make it difficult to predict failures. To this end, we introduce Behavioral Cloning Value Approximation (BCVA), an approach to learning a state value…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Machine Learning and Algorithms