Enhancing Decision Transformer with Diffusion-Based Trajectory Branch Generation
Zhihong Liu, Long Qian, Zeyang Liu, Lipeng Wan, Xingyu Chen, Xuguang, Lan

TL;DR
This paper introduces Diffusion-Based Trajectory Branch Generation to enhance Decision Transformer by expanding trajectories with diffusion-generated branches, enabling better policy learning and outperforming state-of-the-art methods on benchmarks.
Contribution
It proposes a novel diffusion-based method to expand offline RL datasets with trajectory branches, improving policy learning without modifying existing models.
Findings
DT with BG outperforms state-of-the-art methods on D4RL benchmarks.
Trajectory expansion with diffusion improves policy convergence.
The method effectively prevents convergence to sub-optimal trajectories.
Abstract
Decision Transformer (DT) can learn effective policy from offline datasets by converting the offline reinforcement learning (RL) into a supervised sequence modeling task, where the trajectory elements are generated auto-regressively conditioned on the return-to-go (RTG).However, the sequence modeling learning approach tends to learn policies that converge on the sub-optimal trajectories within the dataset, for lack of bridging data to move to better trajectories, even if the condition is set to the highest RTG.To address this issue, we introduce Diffusion-Based Trajectory Branch Generation (BG), which expands the trajectories of the dataset with branches generated by a diffusion model.The trajectory branch is generated based on the segment of the trajectory within the dataset, and leads to trajectories with higher returns.We concatenate the generated branch with the trajectory segment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSimulation Techniques and Applications · Robotic Path Planning Algorithms · AI-based Problem Solving and Planning
MethodsAttention Is All You Need · Adam · Residual Connection · Byte Pair Encoding · Linear Layer · Sparse Evolutionary Training · Absolute Position Encodings · Multi-Head Attention · Dense Connections · Label Smoothing
