MGDA: Model-based Goal Data Augmentation for Offline Goal-conditioned   Weighted Supervised Learning

Xing Lei; Xuetao Zhang; Donglin Wang

arXiv:2412.11410·cs.LG·December 23, 2024

MGDA: Model-based Goal Data Augmentation for Offline Goal-conditioned Weighted Supervised Learning

Xing Lei, Xuetao Zhang, Donglin Wang

PDF

Open Access 1 Video

TL;DR

This paper introduces MGDA, a model-based goal data augmentation method that improves goal data sampling for offline goal-conditioned reinforcement learning, enhancing trajectory stitching and overall performance.

Contribution

The paper proposes a novel MGDA approach that uses a learned dynamics model and Lipschitz continuity to improve goal augmentation in GCWSL methods.

Findings

01

MGDA outperforms previous augmentation techniques in maze datasets.

02

MGDA enhances trajectory stitching capabilities.

03

Empirical results confirm improved goal-reaching performance.

Abstract

Recently, a state-of-the-art family of algorithms, known as Goal-Conditioned Weighted Supervised Learning (GCWSL) methods, has been introduced to tackle challenges in offline goal-conditioned reinforcement learning (RL). GCWSL optimizes a lower bound of the goal-conditioned RL objective and has demonstrated outstanding performance across diverse goal-reaching tasks, providing a simple, effective, and stable solution. However, prior research has identified a critical limitation of GCWSL: the lack of trajectory stitching capabilities. To address this, goal data augmentation strategies have been proposed to enhance these methods. Nevertheless, existing techniques often struggle to sample suitable augmented goals for GCWSL effectively. In this paper, we establish unified principles for goal data augmentation, focusing on goal diversity, action optimality, and goal reachability. Based on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

MGDA: Model-based Goal Data Augmentation for Offline Goal-conditioned Weighted Supervised Learning· underline

Taxonomy

TopicsEducational Technology and Assessment