Dream to Control: Learning Behaviors by Latent Imagination

Danijar Hafner; Timothy Lillicrap; Jimmy Ba; Mohammad Norouzi

arXiv:1912.01603·cs.LG·March 18, 2020·137 cites

Dream to Control: Learning Behaviors by Latent Imagination

Danijar Hafner, Timothy Lillicrap, Jimmy Ba, Mohammad Norouzi

PDF

Open Access 5 Repos 1 Models 1 Video

TL;DR

Dreamer is a reinforcement learning agent that uses latent imagination within learned world models to efficiently solve complex visual tasks, outperforming existing methods in multiple metrics.

Contribution

The paper introduces Dreamer, a novel approach that leverages latent space imagination and analytic gradients for efficient long-horizon reinforcement learning from images.

Findings

01

Outperforms existing methods in data-efficiency

02

Achieves higher final performance on visual control tasks

03

Reduces computation time significantly

Abstract

Learned world models summarize an agent's experience to facilitate learning complex behaviors. While learning world models from high-dimensional sensory inputs is becoming feasible through deep learning, there are many potential ways for deriving behaviors from them. We present Dreamer, a reinforcement learning agent that solves long-horizon tasks from images purely by latent imagination. We efficiently learn behaviors by propagating analytic gradients of learned state values back through trajectories imagined in the compact state space of a learned world model. On 20 challenging visual control tasks, Dreamer exceeds existing approaches in data-efficiency, computation time, and final performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
minhphd/PyDreamerV1
model

Videos

Dream to Control: Learning Behaviors by Latent Imagination· youtube

Taxonomy

TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications