GTA: Generative Trajectory Augmentation with Guidance for Offline   Reinforcement Learning

Jaewoo Lee; Sujin Yun; Taeyoung Yun; Jinkyoo Park

arXiv:2405.16907·cs.AI·November 8, 2024

GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning

Jaewoo Lee, Sujin Yun, Taeyoung Yun, Jinkyoo Park

PDF

Open Access 1 Repo

TL;DR

GTA introduces a diffusion-based generative data augmentation method that enhances offline RL by creating high-reward, plausible trajectories, leading to improved policy performance and data quality across diverse tasks.

Contribution

The paper presents GTA, a novel diffusion model-based trajectory augmentation technique that directly improves offline RL data quality and policy outcomes.

Findings

01

GTA improves offline RL performance across various tasks.

02

GTA enhances the quality of augmented data.

03

GTA outperforms existing augmentation methods.

Abstract

Offline Reinforcement Learning (Offline RL) presents challenges of learning effective decision-making policies from static datasets without any online interactions. Data augmentation techniques, such as noise injection and data synthesizing, aim to improve Q-function approximation by smoothing the learned state-action region. However, these methods often fall short of directly improving the quality of offline datasets, leading to suboptimal results. In response, we introduce GTA, Generative Trajectory Augmentation, a novel generative data augmentation approach designed to enrich offline data by augmenting trajectories to be both high-rewarding and dynamically plausible. GTA applies a diffusion model within the data augmentation framework. GTA partially noises original trajectories and then denoises them with classifier-free guidance via conditioning on amplified return value. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jaewoopudding/gta
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics · Robot Manipulation and Learning

MethodsDiffusion