Beyond Autoregressive RTG: Conditioning via Injection Outside Sequential Modeling in Decision Transformer

Yongyi Wang; Hanyu Liu; Lingfeng Li; Bozhou Chen; Ang Li; Qirui Zheng; Xionghui Yang; Chucai Wang; Wenxin Li

arXiv:2605.06104·cs.LG·May 8, 2026

Beyond Autoregressive RTG: Conditioning via Injection Outside Sequential Modeling in Decision Transformer

Yongyi Wang, Hanyu Liu, Lingfeng Li, Bozhou Chen, Ang Li, Qirui Zheng, Xionghui Yang, Chucai Wang, Wenxin Li

PDF

TL;DR

SlimDT improves offline reinforcement learning by injecting RTG information into state representations, reducing sequence length and computational cost while enhancing performance over standard Decision Transformer.

Contribution

Proposes removing RTG from autoregressive sequences and injecting it into state representations, leading to efficiency gains and better task performance.

Findings

01

SlimDT reduces sequence length by one-third.

02

SlimDT outperforms standard Decision Transformer on D4RL tasks.

03

Decoupling RTG improves both efficiency and effectiveness.

Abstract

Decision Transformer (DT) formulates offline reinforcement learning as autoregressive sequence modeling, achieving promising results by predicting actions from a sequence of Return-to-Go (RTG), state, and action tokens. However, RTG is a scalar that summarizes future rewards, containing far less information than typical state or action vectors, yet it consumes the same computational budget per token. Worse, the self-attention cost of Transformers grows quadratically with sequence length, so including RTG as a separate token adds unnecessary overhead. We propose SlimDT, which removes RTG from the autoregressive sequence. Instead, we inject RTG information into the state representations before the sequential modeling step, allowing the Transformer to process only a compact (state, action) sequence. This reduces the sequence length by one-third, directly improving inference efficiency. On…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.