Imagination Policy: Using Generative Point Cloud Models for Learning   Manipulation Policies

Haojie Huang; Karl Schmeckpeper; Dian Wang; Ondrej Biza; Yaoyao Qian,; Haotian Liu; Mingxi Jia; Robert Platt; Robin Walters

arXiv:2406.11740·cs.RO·December 3, 2024

Imagination Policy: Using Generative Point Cloud Models for Learning Manipulation Policies

Haojie Huang, Karl Schmeckpeper, Dian Wang, Ondrej Biza, Yaoyao Qian,, Haotian Liu, Mingxi Jia, Robert Platt, Robin Walters

PDF

Open Access

TL;DR

This paper introduces Imagination Policy, a novel approach that uses generative point cloud models to imagine goal states for high-precision manipulation, achieving high efficiency and generalization in robotic pick-and-place tasks.

Contribution

It presents a new multi-task key-frame policy network that generates point clouds to imagine goal states, transforming action inference into a local generative task for robotic manipulation.

Findings

01

Achieves state-of-the-art performance on RLbench benchmark

02

Demonstrates high sample efficiency and generalization

03

Validates effectiveness on real robot experiments

Abstract

Humans can imagine goal states during planning and perform actions to match those goals. In this work, we propose Imagination Policy, a novel multi-task key-frame policy network for solving high-precision pick and place tasks. Instead of learning actions directly, Imagination Policy generates point clouds to imagine desired states which are then translated to actions using rigid action estimation. This transforms action inference into a local generative task. We leverage pick and place symmetries underlying the tasks in the generation process and achieve extremely high sample efficiency and generalizability to unseen configurations. Finally, we demonstrate state-of-the-art performance across various tasks on the RLbench benchmark compared with several strong baselines and validate our approach on a real robot.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Numerical Analysis Techniques