Goal-Conditioned Data Augmentation for Offline Reinforcement Learning

Xingshuai Huang; Di Wu; and Benoit Boulet

arXiv:2412.20519·cs.LG·September 3, 2025

Goal-Conditioned Data Augmentation for Offline Reinforcement Learning

Xingshuai Huang, Di Wu, and Benoit Boulet

PDF

Open Access

TL;DR

This paper introduces GODA, a goal-conditioned diffusion-based data augmentation method that improves offline reinforcement learning by generating higher-quality, goal-oriented samples to overcome suboptimal datasets.

Contribution

GODA is a novel goal-conditioned diffusion approach with return-based guidance and adaptive gating, enhancing data quality for offline RL from limited demonstrations.

Findings

01

GODA outperforms state-of-the-art augmentation methods on D4RL benchmarks.

02

GODA improves policy performance in traffic signal control tasks.

03

The method effectively generates higher-return samples from suboptimal datasets.

Abstract

Offline reinforcement learning (RL) enables policy learning from pre-collected offline datasets, relaxing the need to interact directly with the environment. However, limited by the quality of offline datasets, it generally fails to learn well-qualified policies in suboptimal datasets. To address datasets with insufficient optimal demonstrations, we introduce Goal-cOnditioned Data Augmentation (GODA), a novel goal-conditioned diffusion-based method for augmenting samples with higher quality. Leveraging recent advancements in generative modelling, GODA incorporates a novel return-oriented goal condition with various selection mechanisms. Specifically, we introduce a controllable scaling technique to provide enhanced return-based guidance during data sampling. GODA learns a comprehensive distribution representation of the original offline datasets while generating new data with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics · Traffic control and management