Out-of-Distribution Recovery with Object-Centric Keypoint Inverse Policy for Visuomotor Imitation Learning

George Jiayuan Gao; Tianyu Li; Nadia Figueroa

arXiv:2411.03294·cs.RO·July 18, 2025

Out-of-Distribution Recovery with Object-Centric Keypoint Inverse Policy for Visuomotor Imitation Learning

George Jiayuan Gao, Tianyu Li, Nadia Figueroa

PDF

Open Access

TL;DR

This paper introduces an object-centric recovery framework that enhances visuomotor imitation learning by guiding policies back to training distribution in out-of-distribution scenarios, improving robustness without extra data collection.

Contribution

It presents a novel object-centric recovery policy based on inverse policy inference from object keypoints, which can be added to existing policies to handle OOD situations.

Findings

01

77.7% improvement over base policy in OOD scenarios

02

Effective in both simulation and real robot experiments

03

Enables autonomous demonstration collection for continual learning

Abstract

We propose an object-centric recovery (OCR) framework to address the challenges of out-of-distribution (OOD) scenarios in visuomotor policy learning. Previous behavior cloning (BC) methods rely heavily on a large amount of labeled data coverage, failing in unfamiliar spatial states. Without relying on extra data collection, our approach learns a recovery policy constructed by an inverse policy inferred from the object keypoint manifold gradient in the original training data. The recovery policy serves as a simple add-on to any base visuomotor BC policy, agnostic to a specific method, guiding the system back towards the training distribution to ensure task success even in OOD situations. We demonstrate the effectiveness of our object-centric framework in both simulation and real robot experiments, achieving an improvement of 77.7\% over the base policy in OOD. Furthermore, we show OCR's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Robot Manipulation and Learning · Human Pose and Action Recognition

MethodsBalanced Selection