Enhancing Decision Transformer with Diffusion-Based Trajectory Branch   Generation

Zhihong Liu; Long Qian; Zeyang Liu; Lipeng Wan; Xingyu Chen; Xuguang; Lan

arXiv:2411.11327·cs.LG·November 19, 2024

Enhancing Decision Transformer with Diffusion-Based Trajectory Branch Generation

Zhihong Liu, Long Qian, Zeyang Liu, Lipeng Wan, Xingyu Chen, Xuguang, Lan

PDF

Open Access

TL;DR

This paper introduces Diffusion-Based Trajectory Branch Generation to enhance Decision Transformer by expanding trajectories with diffusion-generated branches, enabling better policy learning and outperforming state-of-the-art methods on benchmarks.

Contribution

It proposes a novel diffusion-based method to expand offline RL datasets with trajectory branches, improving policy learning without modifying existing models.

Findings

01

DT with BG outperforms state-of-the-art methods on D4RL benchmarks.

02

Trajectory expansion with diffusion improves policy convergence.

03

The method effectively prevents convergence to sub-optimal trajectories.

Abstract

Decision Transformer (DT) can learn effective policy from offline datasets by converting the offline reinforcement learning (RL) into a supervised sequence modeling task, where the trajectory elements are generated auto-regressively conditioned on the return-to-go (RTG).However, the sequence modeling learning approach tends to learn policies that converge on the sub-optimal trajectories within the dataset, for lack of bridging data to move to better trajectories, even if the condition is set to the highest RTG.To address this issue, we introduce Diffusion-Based Trajectory Branch Generation (BG), which expands the trajectories of the dataset with branches generated by a diffusion model.The trajectory branch is generated based on the segment of the trajectory within the dataset, and leads to trajectories with higher returns.We concatenate the generated branch with the trajectory segment…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSimulation Techniques and Applications · Robotic Path Planning Algorithms · AI-based Problem Solving and Planning

MethodsAttention Is All You Need · Adam · Residual Connection · Byte Pair Encoding · Linear Layer · Sparse Evolutionary Training · Absolute Position Encodings · Multi-Head Attention · Dense Connections · Label Smoothing