ASTRO: Adaptive Stitching via Dynamics-Guided Trajectory Rollouts
Hang Yu, Di Zhang, Qiwei Du, Yanping Zhao, Hai Zhang, Guang Chen, Eduardo E. Veas, Junqiao Zhao

TL;DR
ASTRO introduces a novel data augmentation framework for offline reinforcement learning that generates dynamics-consistent trajectories through adaptive stitching, significantly improving policy performance on standard benchmarks.
Contribution
The paper presents ASTRO, a new method that adaptively stitches trajectories using dynamics-guided planning, addressing limitations of existing augmentation techniques in offline RL.
Findings
ASTRO outperforms prior augmentation methods on OGBench and D4RL benchmarks.
The method achieves significant policy performance improvements.
ASTRO effectively generates feasible, dynamics-consistent trajectories for offline RL.
Abstract
Offline reinforcement learning (RL) enables agents to learn optimal policies from pre-collected datasets. However, datasets containing suboptimal and fragmented trajectories present challenges for reward propagation, resulting in inaccurate value estimation and degraded policy performance. While trajectory stitching via generative models offers a promising solution, existing augmentation methods frequently produce trajectories that are either confined to the support of the behavior policy or violate the underlying dynamics, thereby limiting their effectiveness for policy improvement. We propose ASTRO, a data augmentation framework that generates distributionally novel and dynamics-consistent trajectories for offline RL. ASTRO first learns a temporal-distance representation to identify distinct and reachable stitch targets. We then employ a dynamics-guided stitch planner that adaptively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Generative Adversarial Networks and Image Synthesis · Autonomous Vehicle Technology and Safety
