Learning Category-Level Generalizable Object Manipulation Policy via   Generative Adversarial Self-Imitation Learning from Demonstrations

Hao Shen; Weikang Wan; He Wang

arXiv:2203.02107·cs.RO·September 14, 2022

Learning Category-Level Generalizable Object Manipulation Policy via Generative Adversarial Self-Imitation Learning from Demonstrations

Hao Shen, Weikang Wan, He Wang

PDF

Open Access 2 Repos

TL;DR

This paper introduces a novel imitation learning approach using generative adversarial self-imitation to enable robots to learn generalizable object manipulation policies across diverse categories without dense rewards.

Contribution

It proposes a new generative adversarial self-imitation learning framework with techniques like progressive discriminator growing and instance balancing, improving generalization in manipulation tasks.

Findings

01

Significant performance improvements on ManiSkill benchmarks.

02

Effective handling of unseen object instances.

03

Validation of each technique's contribution through ablation studies.

Abstract

Generalizable object manipulation skills are critical for intelligent and multi-functional robots to work in real-world complex scenes. Despite the recent progress in reinforcement learning, it is still very challenging to learn a generalizable manipulation policy that can handle a category of geometrically diverse articulated objects. In this work, we tackle this category-level object manipulation policy learning problem via imitation learning in a task-agnostic manner, where we assume no handcrafted dense rewards but only a terminal reward. Given this novel and challenging generalizable policy learning problem, we identify several key issues that can fail the previous imitation learning algorithms and hinder the generalization to unseen instances. We then propose several general but critical techniques, including generative adversarial self-imitation learning from demonstrations,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Multimodal Machine Learning Applications