Out-of-Distribution Recovery with Object-Centric Keypoint Inverse Policy for Visuomotor Imitation Learning
George Jiayuan Gao, Tianyu Li, Nadia Figueroa

TL;DR
This paper introduces an object-centric recovery framework that enhances visuomotor imitation learning by guiding policies back to training distribution in out-of-distribution scenarios, improving robustness without extra data collection.
Contribution
It presents a novel object-centric recovery policy based on inverse policy inference from object keypoints, which can be added to existing policies to handle OOD situations.
Findings
77.7% improvement over base policy in OOD scenarios
Effective in both simulation and real robot experiments
Enables autonomous demonstration collection for continual learning
Abstract
We propose an object-centric recovery (OCR) framework to address the challenges of out-of-distribution (OOD) scenarios in visuomotor policy learning. Previous behavior cloning (BC) methods rely heavily on a large amount of labeled data coverage, failing in unfamiliar spatial states. Without relying on extra data collection, our approach learns a recovery policy constructed by an inverse policy inferred from the object keypoint manifold gradient in the original training data. The recovery policy serves as a simple add-on to any base visuomotor BC policy, agnostic to a specific method, guiding the system back towards the training distribution to ensure task success even in OOD situations. We demonstrate the effectiveness of our object-centric framework in both simulation and real robot experiments, achieving an improvement of 77.7\% over the base policy in OOD. Furthermore, we show OCR's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Robot Manipulation and Learning · Human Pose and Action Recognition
MethodsBalanced Selection
