Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding
Grace Zhang, Linghan Zhong, Youngwoon Lee, Joseph J. Lim

TL;DR
This paper introduces IDAPT, an iterative environment grounding method that effectively transfers policies across visual and dynamics domain gaps, enabling robots to adapt from simulated or lab environments to real-world settings with minimal supervision.
Contribution
The paper presents a novel iterative policy transfer approach that simultaneously addresses visual and dynamics domain gaps through environment grounding and policy training.
Findings
Successfully transfers policies across visual and dynamics gaps
Reduces the need for extensive target environment interaction
Demonstrates effectiveness on locomotion and manipulation tasks
Abstract
The ability to transfer a policy from one environment to another is a promising avenue for efficient robot learning in realistic settings where task supervision is not available. This can allow us to take advantage of environments well suited for training, such as simulators or laboratories, to learn a policy for a real robot in a home or office. To succeed, such policy transfer must overcome both the visual domain gap (e.g. different illumination or background) and the dynamics domain gap (e.g. different robot calibration or modelling error) between source and target environments. However, prior policy transfer approaches either cannot handle a large domain gap or can only address one type of domain gap at a time. In this paper, we propose a novel policy transfer method with iterative "environment grounding", IDAPT, that alternates between (1) directly minimizing both visual and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Human Pose and Action Recognition
