DARLA: Improving Zero-Shot Transfer in Reinforcement Learning

Irina Higgins; Arka Pal; Andrei A. Rusu; Loic Matthey; Christopher P; Burgess; Alexander Pritzel; Matthew Botvinick; Charles Blundell; Alexander; Lerchner

arXiv:1707.08475·stat.ML·June 7, 2018·139 cites

DARLA: Improving Zero-Shot Transfer in Reinforcement Learning

Irina Higgins, Arka Pal, Andrei A. Rusu, Loic Matthey, Christopher P, Burgess, Alexander Pritzel, Matthew Botvinick, Charles Blundell, Alexander, Lerchner

PDF

Open Access 1 Repo

TL;DR

DARLA introduces a multi-stage reinforcement learning agent that learns disentangled visual representations to enable robust zero-shot transfer across diverse domains without target domain data.

Contribution

It proposes DARLA, a novel RL agent that learns to see before acting, improving zero-shot transfer by disentangling environment representations.

Findings

01

DARLA outperforms baselines in zero-shot domain adaptation.

02

Effective across multiple RL environments and algorithms.

03

Disentangled representations enhance robustness to domain shifts.

Abstract

Domain adaptation is an important open problem in deep reinforcement learning (RL). In many scenarios of interest data is hard to obtain, so agents may learn a source policy in a setting where data is readily available, with the hope that it generalises well to the target domain. We propose a new multi-stage RL agent, DARLA (DisentAngled Representation Learning Agent), which learns to see before learning to act. DARLA's vision is based on learning a disentangled representation of the observed environment. Once DARLA can see, it is able to acquire source policies that are robust to many domain shifts - even with no access to the target domain. DARLA significantly outperforms conventional baselines in zero-shot domain adaptation scenarios, an effect that holds across a variety of RL environments (Jaco arm, DeepMind Lab) and base RL algorithms (DQN, A3C and EC).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

BCHoagland/DARLA
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Adaptive Dynamic Programming Control

MethodsEntropy Regularization · Dense Connections · Softmax · Convolution · A3C