Decoupling Return-to-Go for Efficient Decision Transformer

Yongyi Wang; Hanyu Liu; Lingfeng Li; Bozhou Chen; Ang Li; Qirui Zheng; Xionghui Yang; Wenxin Li

arXiv:2601.15953·cs.AI·January 23, 2026

Decoupling Return-to-Go for Efficient Decision Transformer

Yongyi Wang, Hanyu Liu, Lingfeng Li, Bozhou Chen, Ang Li, Qirui Zheng, Xionghui Yang, Wenxin Li

PDF

Open Access

TL;DR

This paper introduces Decoupled Decision Transformer (DDT), which simplifies the original DT architecture by using only the latest RTG for action prediction, leading to improved performance and efficiency in offline reinforcement learning.

Contribution

The paper identifies a redundancy in DT's use of RTG sequences and proposes DDT, a streamlined model that enhances performance and reduces computational costs.

Findings

01

DDT outperforms original DT in multiple offline RL tasks.

02

Using only the latest RTG improves decision transformer performance.

03

DDT achieves competitive results against state-of-the-art DT variants.

Abstract

The Decision Transformer (DT) has established a powerful sequence modeling approach to offline reinforcement learning. It conditions its action predictions on Return-to-Go (RTG), using it both to distinguish trajectory quality during training and to guide action generation at inference. In this work, we identify a critical redundancy in this design: feeding the entire sequence of RTGs into the Transformer is theoretically unnecessary, as only the most recent RTG affects action prediction. We show that this redundancy can impair DT's performance through experiments. To resolve this, we propose the Decoupled DT (DDT). DDT simplifies the architecture by processing only observation and action sequences through the Transformer, using the latest RTG to guide the action prediction. This streamlined approach not only improves performance but also reduces computational cost. Our experiments show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)