DreamingV2: Reinforcement Learning with Discrete World Models without   Reconstruction

Masashi Okada; Tadahiro Taniguchi

arXiv:2203.00494·cs.LG·March 2, 2022

DreamingV2: Reinforcement Learning with Discrete World Models without Reconstruction

Masashi Okada, Tadahiro Taniguchi

PDF

Open Access

TL;DR

DreamingV2 introduces a novel reinforcement learning approach combining discrete world models with a reconstruction-free learning objective, achieving superior performance on complex 3D robot tasks without autoencoding.

Contribution

It integrates discrete latent representations with a reconstruction-free contrastive learning objective in reinforcement learning, advancing model-based methods from pixels.

Findings

01

Achieves top scores on five challenging 3D robot arm tasks.

02

Utilizes discrete representations suitable for discontinuous environments.

03

Employs a reconstruction-free approach to handle complex visual observations.

Abstract

The present paper proposes a novel reinforcement learning method with world models, DreamingV2, a collaborative extension of DreamerV2 and Dreaming. DreamerV2 is a cutting-edge model-based reinforcement learning from pixels that uses discrete world models to represent latent states with categorical variables. Dreaming is also a form of reinforcement learning from pixels that attempts to avoid the autoencoding process in general world model training by involving a reconstruction-free contrastive learning objective. The proposed DreamingV2 is a novel approach of adopting both the discrete representation of DreamingV2 and the reconstruction-free objective of Dreaming. Compared to DreamerV2 and other recent model-based methods without reconstruction, DreamingV2 achieves the best scores on five simulated challenging 3D robot arm tasks. We believe that DreamingV2 will be a reliable solution…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications

MethodsContrastive Learning