Goal-Conditioned Data Augmentation for Offline Reinforcement Learning
Xingshuai Huang, Di Wu, and Benoit Boulet

TL;DR
This paper introduces GODA, a goal-conditioned diffusion-based data augmentation method that improves offline reinforcement learning by generating higher-quality, goal-oriented samples to overcome suboptimal datasets.
Contribution
GODA is a novel goal-conditioned diffusion approach with return-based guidance and adaptive gating, enhancing data quality for offline RL from limited demonstrations.
Findings
GODA outperforms state-of-the-art augmentation methods on D4RL benchmarks.
GODA improves policy performance in traffic signal control tasks.
The method effectively generates higher-return samples from suboptimal datasets.
Abstract
Offline reinforcement learning (RL) enables policy learning from pre-collected offline datasets, relaxing the need to interact directly with the environment. However, limited by the quality of offline datasets, it generally fails to learn well-qualified policies in suboptimal datasets. To address datasets with insufficient optimal demonstrations, we introduce Goal-cOnditioned Data Augmentation (GODA), a novel goal-conditioned diffusion-based method for augmenting samples with higher quality. Leveraging recent advancements in generative modelling, GODA incorporates a novel return-oriented goal condition with various selection mechanisms. Specifically, we introduce a controllable scaling technique to provide enhanced return-based guidance during data sampling. GODA learns a comprehensive distribution representation of the original offline datasets while generating new data with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics · Traffic control and management
