DreamingV2: Reinforcement Learning with Discrete World Models without Reconstruction
Masashi Okada, Tadahiro Taniguchi

TL;DR
DreamingV2 introduces a novel reinforcement learning approach combining discrete world models with a reconstruction-free learning objective, achieving superior performance on complex 3D robot tasks without autoencoding.
Contribution
It integrates discrete latent representations with a reconstruction-free contrastive learning objective in reinforcement learning, advancing model-based methods from pixels.
Findings
Achieves top scores on five challenging 3D robot arm tasks.
Utilizes discrete representations suitable for discontinuous environments.
Employs a reconstruction-free approach to handle complex visual observations.
Abstract
The present paper proposes a novel reinforcement learning method with world models, DreamingV2, a collaborative extension of DreamerV2 and Dreaming. DreamerV2 is a cutting-edge model-based reinforcement learning from pixels that uses discrete world models to represent latent states with categorical variables. Dreaming is also a form of reinforcement learning from pixels that attempts to avoid the autoencoding process in general world model training by involving a reconstruction-free contrastive learning objective. The proposed DreamingV2 is a novel approach of adopting both the discrete representation of DreamingV2 and the reconstruction-free objective of Dreaming. Compared to DreamerV2 and other recent model-based methods without reconstruction, DreamingV2 achieves the best scores on five simulated challenging 3D robot arm tasks. We believe that DreamingV2 will be a reliable solution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
MethodsContrastive Learning
