Goal-Conditioned Decision Transformer for Multi-Goal Offline Reinforcement Learning

Pawe{\l} Gajewski; Dominik \.Zurek; Marcin Pietro\'n; Kamil Faber

arXiv:2410.06347·cs.RO·May 11, 2026

Goal-Conditioned Decision Transformer for Multi-Goal Offline Reinforcement Learning

Pawe{\l} Gajewski, Dominik \.Zurek, Marcin Pietro\'n, Kamil Faber

PDF

TL;DR

This paper presents a goal-conditioned decision transformer tailored for offline multi-goal robotics, improving task performance and robustness using pre-collected data.

Contribution

It introduces a novel transformer-based framework that explicitly incorporates goal states for efficient multi-goal offline reinforcement learning in robotics.

Findings

01

Outperforms state-of-the-art online methods on complex tasks

02

Maintains robustness in sparse-reward scenarios

03

Effective with limited expert demonstrations

Abstract

Reinforcement learning (RL) in robotics faces significant hurdles regarding sample efficiency and generalization across varying goals. While Offline RL mitigates the need for costly online interactions, its integration with goal-conditioned policies and transformer-based architectures remains underexplored. We introduce a Goal-Conditioned Decision Transformer adapted for offline multi-goal robotics. By explicitly incorporating goal states into the sequence modeling framework, our approach efficiently solves varying tasks using only pre-collected data. We validate this method on a newly released offline dataset for the Franka Emika Panda platform. Experimental results demonstrate that our approach outperforms state-of-the-art online baselines in complex tasks and maintains robustness in sparse-reward settings, even with limited expert demonstrations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.