Cross Domain Robot Imitation with Invariant Representation
Zhao-Heng Yin, Lingfeng Sun, Hengbo Ma, Masayoshi Tomizuka, Wu-Jun Li

TL;DR
This paper introduces an invariant representation-based imitation learning algorithm that enables cross domain robot imitation without human-labeled data, successfully generalizing to new robots in simulation.
Contribution
The paper proposes a novel invariant representation learning method for cross domain imitation learning that does not require pairwise labels and leverages cycle-consistency and domain confusion.
Findings
Successfully generalizes to unseen robots in simulation
Learns similar representations for different robots with similar behaviors
Does not require human-labeled pairwise data
Abstract
Animals are able to imitate each others' behavior, despite their difference in biomechanics. In contrast, imitating the other similar robots is a much more challenging task in robotics. This problem is called cross domain imitation learning~(CDIL). In this paper, we consider CDIL on a class of similar robots. We tackle this problem by introducing an imitation learning algorithm based on invariant representation. We propose to learn invariant state and action representations, which aligns the behavior of multiple robots so that CDIL becomes possible. Compared with previous invariant representation learning methods for similar purpose, our method does not require human-labeled pairwise data for training. Instead, we use cycle-consistency and domain confusion to align the representation and increase its robustness. We test the algorithm on multiple robots in simulator and show that unseen…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Robot Manipulation and Learning · Domain Adaptation and Few-Shot Learning
