ADAIL: Adaptive Adversarial Imitation Learning

Yiren Lu; Jonathan Tompson

arXiv:2008.12647·cs.LG·August 31, 2020·5 cites

ADAIL: Adaptive Adversarial Imitation Learning

Yiren Lu, Jonathan Tompson

PDF

Open Access

TL;DR

ADAIL introduces an adaptive adversarial imitation learning algorithm that enables policies to transfer across environments with different dynamics by using a dynamics embedding and domain-adversarial training, demonstrated on simulated control tasks.

Contribution

The paper proposes a novel method for adaptive imitation learning that generalizes policies across varying dynamics using adversarial training and dynamics embeddings.

Findings

01

Outperforms recent baselines in simulated control tasks with varying dynamics

02

Learns dynamics-invariant policies effective across multiple environments

03

Demonstrates robustness in transferring learned policies to new dynamics

Abstract

We present the ADaptive Adversarial Imitation Learning (ADAIL) algorithm for learning adaptive policies that can be transferred between environments of varying dynamics, by imitating a small number of demonstrations collected from a single source domain. This is an important problem in robotic learning because in real world scenarios 1) reward functions are hard to obtain, 2) learned policies from one domain are difficult to deploy in another due to varying source to target domain statistics, 3) collecting expert demonstrations in multiple environments where the dynamics are known and controlled is often infeasible. We address these constraints by building upon recent advances in adversarial imitation learning; we condition our policy on a learned dynamics embedding and we employ a domain-adversarial loss to learn a dynamics-invariant discriminator. The effectiveness of our method is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Model Reduction and Neural Networks