CURLing the Dream: Contrastive Representations for World Modeling in   Reinforcement Learning

Victor Augusto Kich; Jair Augusto Bottega; Raul Steinmetz; Ricardo; Bedin Grando; Ayano Yorozu; Akihisa Ohya

arXiv:2408.05781·cs.LG·September 4, 2024

CURLing the Dream: Contrastive Representations for World Modeling in Reinforcement Learning

Victor Augusto Kich, Jair Augusto Bottega, Raul Steinmetz, Ricardo, Bedin Grando, Ayano Yorozu, Akihisa Ohya

PDF

Open Access

TL;DR

Curled-Dreamer is a new reinforcement learning algorithm that combines contrastive learning with DreamerV3, leading to improved performance, faster learning, and more robust policies in visual tasks.

Contribution

This work introduces Curled-Dreamer, integrating contrastive loss into DreamerV3, which is a novel combination enhancing learning efficiency and robustness in visual reinforcement learning.

Findings

01

Outperforms state-of-the-art algorithms on DeepMind Control Suite tasks.

02

Achieves higher mean and median scores across diverse tasks.

03

Accelerates learning and improves policy robustness.

Abstract

In this work, we present Curled-Dreamer, a novel reinforcement learning algorithm that integrates contrastive learning into the DreamerV3 framework to enhance performance in visual reinforcement learning tasks. By incorporating the contrastive loss from the CURL algorithm and a reconstruction loss from autoencoder, Curled-Dreamer achieves significant improvements in various DeepMind Control Suite tasks. Our extensive experiments demonstrate that Curled-Dreamer consistently outperforms state-of-the-art algorithms, achieving higher mean and median scores across a diverse set of tasks. The results indicate that the proposed approach not only accelerates learning but also enhances the robustness of the learned policies. This work highlights the potential of combining different learning paradigms to achieve superior performance in reinforcement learning applications.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsSparse Evolutionary Training · Contrastive Learning