Bridging Imagination and Reality for Model-Based Deep Reinforcement   Learning

Guangxiang Zhu; Minghao Zhang; Honglak Lee; Chongjie Zhang

arXiv:2010.12142·cs.LG·October 26, 2020·6 cites

Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning

Guangxiang Zhu, Minghao Zhang, Honglak Lee, Chongjie Zhang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces BIRD, a model-based reinforcement learning algorithm that enhances sample efficiency by aligning imaginary and real trajectories through mutual information maximization, leading to improved performance on visual control tasks.

Contribution

The paper proposes a novel method that bridges imaginary and real trajectories in model-based RL, addressing overfitting and improving generalization.

Findings

01

Improves sample efficiency in model-based planning.

02

Achieves state-of-the-art results on visual control benchmarks.

03

Effectively generalizes policy from imaginary to real trajectories.

Abstract

Sample efficiency has been one of the major challenges for deep reinforcement learning. Recently, model-based reinforcement learning has been proposed to address this challenge by performing planning on imaginary trajectories with a learned world model. However, world model learning may suffer from overfitting to training trajectories, and thus model-based value estimation and policy search will be pone to be sucked in an inferior local policy. In this paper, we propose a novel model-based reinforcement learning algorithm, called BrIdging Reality and Dream (BIRD). It maximizes the mutual information between imaginary and real trajectories so that the policy improvement learned from imaginary trajectories can be easily generalized to real trajectories. We demonstrate that our approach improves sample efficiency of model-based planning, and achieves state-of-the-art performance on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Mehooz/BIRD_code
tfOfficial

Videos

Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Artificial Intelligence in Games