Policy Transfer across Visual and Dynamics Domain Gaps via Iterative   Grounding

Grace Zhang; Linghan Zhong; Youngwoon Lee; Joseph J. Lim

arXiv:2107.00339·cs.RO·July 2, 2021

Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding

Grace Zhang, Linghan Zhong, Youngwoon Lee, Joseph J. Lim

PDF

Open Access 1 Repo

TL;DR

This paper introduces IDAPT, an iterative environment grounding method that effectively transfers policies across visual and dynamics domain gaps, enabling robots to adapt from simulated or lab environments to real-world settings with minimal supervision.

Contribution

The paper presents a novel iterative policy transfer approach that simultaneously addresses visual and dynamics domain gaps through environment grounding and policy training.

Findings

01

Successfully transfers policies across visual and dynamics gaps

02

Reduces the need for extensive target environment interaction

03

Demonstrates effectiveness on locomotion and manipulation tasks

Abstract

The ability to transfer a policy from one environment to another is a promising avenue for efficient robot learning in realistic settings where task supervision is not available. This can allow us to take advantage of environments well suited for training, such as simulators or laboratories, to learn a policy for a real robot in a home or office. To succeed, such policy transfer must overcome both the visual domain gap (e.g. different illumination or background) and the dynamics domain gap (e.g. different robot calibration or modelling error) between source and target environments. However, prior policy transfer approaches either cannot handle a large domain gap or can only address one type of domain gap at a time. In this paper, we propose a novel policy transfer method with iterative "environment grounding", IDAPT, that alternates between (1) directly minimizing both visual and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

clvrai/idapt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Human Pose and Action Recognition