State-Covering Trajectory Stitching for Diffusion Planners
Kyowoon Lee, Jaesik Choi

TL;DR
This paper introduces SCoTS, a novel trajectory augmentation method that stitches short segments to generate diverse, extended trajectories, enhancing diffusion planners' performance and generalization in offline RL tasks.
Contribution
SCoTS is a reward-free, latent space-based trajectory stitching method that improves long-horizon planning and generalization in diffusion models for offline reinforcement learning.
Findings
SCoTS significantly improves diffusion planner performance on offline benchmarks.
Augmented trajectories enhance offline RL algorithms across various environments.
SCoTS effectively covers and expands the environment's latent space.
Abstract
Diffusion-based generative models are emerging as powerful tools for long-horizon planning in reinforcement learning (RL), particularly with offline datasets. However, their performance is fundamentally limited by the quality and diversity of training data. This often restricts their generalization to tasks outside their training distribution or longer planning horizons. To overcome this challenge, we propose State-Covering Trajectory Stitching (SCoTS), a novel reward-free trajectory augmentation method that incrementally stitches together short trajectory segments, systematically generating diverse and extended trajectories. SCoTS first learns a temporal distance-preserving latent representation that captures the underlying temporal structure of the environment, then iteratively stitches trajectory segments guided by directional exploration and novelty to effectively cover and expand…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
