Off-dynamics Conditional Diffusion Planners
Wen Zheng Terence Ng, Jianda Chen, Tianwei Zhang

TL;DR
This paper introduces a novel conditional diffusion model approach for offline reinforcement learning that effectively leverages off-dynamics datasets, improving data efficiency and robustness to environment shifts.
Contribution
The work proposes a new conditional diffusion probabilistic model with two dynamics contexts to better utilize off-dynamics data in offline RL, outperforming existing baselines.
Findings
Significant performance improvements over strong baselines
Critical role of dynamics contexts demonstrated through ablation studies
Ability to interpolate between source and target dynamics for robustness
Abstract
Offline Reinforcement Learning (RL) offers an attractive alternative to interactive data acquisition by leveraging pre-existing datasets. However, its effectiveness hinges on the quantity and quality of the data samples. This work explores the use of more readily available, albeit off-dynamics datasets, to address the challenge of data scarcity in Offline RL. We propose a novel approach using conditional Diffusion Probabilistic Models (DPMs) to learn the joint distribution of the large-scale off-dynamics dataset and the limited target dataset. To enable the model to capture the underlying dynamics structure, we introduce two contexts for the conditional model: (1) a continuous dynamics score allows for partial overlap between trajectories from both datasets, providing the model with richer information; (2) an inverse-dynamics context guides the model to generate trajectories that adhere…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDynamics and Control of Mechanical Systems · Robotic Path Planning Algorithms · Computational Geometry and Mesh Generation
MethodsDiffusion
