One-Shot Imitation Learning with Invariance Matching for Robotic Manipulation
Xinyu Zhang, Abdeslam Boularias

TL;DR
The paper introduces IMOP, a novel one-shot imitation learning algorithm for robotic manipulation that learns invariant state regions, enabling rapid generalization to new tasks and objects without fine-tuning.
Contribution
IMOP is the first method to learn invariant state regions for one-shot imitation, outperforming existing approaches on multiple manipulation tasks and enabling zero-shot generalization.
Findings
IMOP achieves 4.5% higher success rate on RLBench tasks.
IMOP learns new tasks from a single demonstration without fine-tuning.
IMOP generalizes to new shapes and sim-to-real transfer with one demonstration.
Abstract
Learning a single universal policy that can perform a diverse set of manipulation tasks is a promising new direction in robotics. However, existing techniques are limited to learning policies that can only perform tasks that are encountered during training, and require a large number of demonstrations to learn new tasks. Humans, on the other hand, often can learn a new task from a single unannotated demonstration. In this work, we propose the Invariance-Matching One-shot Policy Learning (IMOP) algorithm. In contrast to the standard practice of learning the end-effector's pose directly, IMOP first learns invariant regions of the state space for a given task, and then computes the end-effector's pose through matching the invariant regions between demonstrations and test scenes. Trained on the 18 RLBench tasks, IMOP achieves a success rate that outperforms the state-of-the-art…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Image Processing Techniques and Applications · Domain Adaptation and Few-Shot Learning
