A Comparison Between Decision Transformers and Traditional Offline Reinforcement Learning Algorithms
Ali Murtaza Caunhye, Asad Jeewa

TL;DR
This paper compares Decision Transformers and traditional offline RL algorithms, analyzing their performance across different reward structures and data qualities in the ANT environment, highlighting their respective strengths and limitations.
Contribution
It provides a comprehensive empirical comparison of Decision Transformers and traditional offline RL methods in various reward and data quality settings.
Findings
DTs are less sensitive to reward density variations.
DTs excel in sparse reward scenarios with medium-expert data.
Traditional methods perform better in dense reward environments.
Abstract
The field of Offline Reinforcement Learning (RL) aims to derive effective policies from pre-collected datasets without active environment interaction. While traditional offline RL algorithms like Conservative Q-Learning (CQL) and Implicit Q-Learning (IQL) have shown promise, they often face challenges in balancing exploration and exploitation, especially in environments with varying reward densities. The recently proposed Decision Transformer (DT) approach, which reframes offline RL as a sequence modelling problem, has demonstrated impressive results across various benchmarks. This paper presents a comparative study evaluating the performance of DT against traditional offline RL algorithms in dense and sparse reward settings for the ANT continous control environment. Our research investigates how these algorithms perform when faced with different reward structures, examining their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Imbalanced Data Classification Techniques
