Zero-Shot Transfer in Imitation Learning

Alvaro Cauderan; Gauthier Boeshertz; Florian Schwarb; Calvin Zhang

arXiv:2310.06710·cs.LG·October 11, 2023

Zero-Shot Transfer in Imitation Learning

Alvaro Cauderan, Gauthier Boeshertz, Florian Schwarb, Calvin Zhang

PDF

Open Access

TL;DR

This paper introduces a zero-shot transfer algorithm for imitation learning that leverages disentangled representations and a single Q-function, enabling domain transfer without retraining, which is crucial for real-world robotic applications.

Contribution

The paper proposes a novel imitation learning method combining AnnealedVAE for disentangled states and a single Q-function for transfer, avoiding adversarial training.

Findings

01

Effective transfer across three diverse environments

02

Avoids retraining in new domains

03

Utilizes disentangled representations for robust transfer

Abstract

We present an algorithm that learns to imitate expert behavior and can transfer to previously unseen domains without retraining. Such an algorithm is extremely relevant in real-world applications such as robotic learning because 1) reward functions are difficult to design, 2) learned policies from one domain are difficult to deploy in another domain and 3) learning directly in the real world is either expensive or unfeasible due to security concerns. To overcome these constraints, we combine recent advances in Deep RL by using an AnnealedVAE to learn a disentangled state representation and imitate an expert by learning a single Q-function which avoids adversarial training. We demonstrate the effectiveness of our method in 3 environments ranging in difficulty and the type of transfer knowledge required.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Reinforcement Learning in Robotics