Accelerating Goal-Conditioned RL Algorithms and Research

Micha{\l} Bortkiewicz; W{\l}adys{\l}aw Pa{\l}ucki; Vivek Myers; Tadeusz Dziarmaga; Tomasz Arczewski; {\L}ukasz Kuci\'nski; Benjamin Eysenbach

arXiv:2408.11052·cs.LG·November 25, 2025

Accelerating Goal-Conditioned RL Algorithms and Research

Micha{\l} Bortkiewicz, W{\l}adys{\l}aw Pa{\l}ucki, Vivek Myers, Tadeusz Dziarmaga, Tomasz Arczewski, {\L}ukasz Kuci\'nski, Benjamin Eysenbach

PDF

Open Access 1 Repo

TL;DR

This paper introduces JaxGCRL, a high-performance benchmark and codebase for self-supervised goal-conditioned reinforcement learning, significantly accelerating training and facilitating research in the field.

Contribution

It provides a GPU-accelerated, stable RL algorithm and benchmark for self-supervised GCRL, enabling rapid training and evaluation of agents in complex environments.

Findings

01

Training time reduced by up to 22 times

02

Identified key design choices that stabilize training

03

Enabled training for millions of environment steps in minutes

Abstract

Self-supervision has the potential to transform reinforcement learning (RL), paralleling the breakthroughs it has enabled in other areas of machine learning. While self-supervised learning in other domains aims to find patterns in a fixed dataset, self-supervised goal-conditioned reinforcement learning (GCRL) agents discover new behaviors by learning from the goals achieved during unstructured interaction with the environment. However, these methods have failed to see similar success, both due to a lack of data from slow environment simulations as well as a lack of stable algorithms. We take a step toward addressing both of these issues by releasing a high-performance codebase and benchmark (JaxGCRL) for self-supervised GCRL, enabling researchers to train agents for millions of environment steps in minutes on a single GPU. By utilizing GPU-accelerated replay buffers, environments, and a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

michalbortkiewicz/jaxgcrl
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFuzzy Logic and Control Systems

MethodsSparse Evolutionary Training · InfoNCE