Predictive Coding for Decision Transformer
Tung M. Luu, Donghoon Lee, and Chang D. Yoo

TL;DR
This paper introduces PCDT, a novel framework that enhances decision transformers in offline RL by incorporating predictive coding, leading to better generalization and performance on complex goal-conditioned tasks.
Contribution
The paper proposes a generalized future conditioning approach for decision transformers, improving their ability to handle unstructured and long-horizon RL datasets.
Findings
PCDT outperforms existing methods on AntMaze and FrankaKitchen datasets.
It achieves comparable or better results than popular value-based and transformer-based methods.
The approach is effective in a physical robot goal-reaching task.
Abstract
Recent work in offline reinforcement learning (RL) has demonstrated the effectiveness of formulating decision-making as return-conditioned supervised learning. Notably, the decision transformer (DT) architecture has shown promise across various domains. However, despite its initial success, DTs have underperformed on several challenging datasets in goal-conditioned RL. This limitation stems from the inefficiency of return conditioning for guiding policy learning, particularly in unstructured and suboptimal datasets, resulting in DTs failing to effectively learn temporal compositionality. Moreover, this problem might be further exacerbated in long-horizon sparse-reward tasks. To address this challenge, we propose the Predictive Coding for Decision Transformer (PCDT) framework, which leverages generalized future conditioning to enhance DT methods. PCDT utilizes an architecture that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsDense Connections · Adam · Linear Layer · Residual Connection · Position-Wise Feed-Forward Layer · Label Smoothing · Attention Is All You Need · Dropout · Byte Pair Encoding · Absolute Position Encodings
