MGDA: Model-based Goal Data Augmentation for Offline Goal-conditioned Weighted Supervised Learning
Xing Lei, Xuetao Zhang, Donglin Wang

TL;DR
This paper introduces MGDA, a model-based goal data augmentation method that improves goal data sampling for offline goal-conditioned reinforcement learning, enhancing trajectory stitching and overall performance.
Contribution
The paper proposes a novel MGDA approach that uses a learned dynamics model and Lipschitz continuity to improve goal augmentation in GCWSL methods.
Findings
MGDA outperforms previous augmentation techniques in maze datasets.
MGDA enhances trajectory stitching capabilities.
Empirical results confirm improved goal-reaching performance.
Abstract
Recently, a state-of-the-art family of algorithms, known as Goal-Conditioned Weighted Supervised Learning (GCWSL) methods, has been introduced to tackle challenges in offline goal-conditioned reinforcement learning (RL). GCWSL optimizes a lower bound of the goal-conditioned RL objective and has demonstrated outstanding performance across diverse goal-reaching tasks, providing a simple, effective, and stable solution. However, prior research has identified a critical limitation of GCWSL: the lack of trajectory stitching capabilities. To address this, goal data augmentation strategies have been proposed to enhance these methods. Nevertheless, existing techniques often struggle to sample suitable augmented goals for GCWSL effectively. In this paper, we establish unified principles for goal data augmentation, focusing on goal diversity, action optimality, and goal reachability. Based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsEducational Technology and Assessment
