Asking for Help: Failure Prediction in Behavioral Cloning through Value Approximation
Cem Gokmen, Daniel Ho, Mohi Khansari

TL;DR
This paper introduces BCVA, a method for predicting failures in behavioral cloning policies by learning a state value function, enabling robots to request help and improve autonomous operation in real-world tasks.
Contribution
The paper presents BCVA, a novel approach to predict failures in behavioral cloning by jointly learning a value function, enhancing failure detection in robotic manipulation tasks.
Findings
Achieved 86% precision and 81% recall in failure prediction.
Improved failure detection accuracy by 10 percentage points over baseline.
Validated on over 2000 real-world robot runs.
Abstract
Recent progress in end-to-end Imitation Learning approaches has shown promising results and generalization capabilities on mobile manipulation tasks. Such models are seeing increasing deployment in real-world settings, where scaling up requires robots to be able to operate with high autonomy, i.e. requiring as little human supervision as possible. In order to avoid the need for one-on-one human supervision, robots need to be able to detect and prevent policy failures ahead of time, and ask for help, allowing a remote operator to supervise multiple robots and help when needed. However, the black-box nature of end-to-end Imitation Learning models such as Behavioral Cloning, as well as the lack of an explicit state-value representation, make it difficult to predict failures. To this end, we introduce Behavioral Cloning Value Approximation (BCVA), an approach to learning a state value…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Machine Learning and Algorithms
