Improving and Benchmarking Offline Reinforcement Learning Algorithms
Bingyi Kang, Xiao Ma, Yirui Wang, Yang Yue, Shuicheng Yan

TL;DR
This paper investigates the impact of low-level implementation choices and datasets on offline RL performance, providing a comprehensive benchmark and a guidebook that lead to new state-of-the-art results.
Contribution
It empirically analyzes 20 implementation choices across three algorithms, introduces a guidebook for implementation, and benchmarks multiple algorithms under a unified framework.
Findings
Implementation choices significantly affect performance.
Data distribution influences learning success.
Previous conclusions may be dataset-biased.
Abstract
Recently, Offline Reinforcement Learning (RL) has achieved remarkable progress with the emergence of various algorithms and datasets. However, these methods usually focus on algorithmic advancements, ignoring that many low-level implementation choices considerably influence or even drive the final performance. As a result, it becomes hard to attribute the progress in Offline RL as these choices are not sufficiently discussed and aligned in the literature. In addition, papers focusing on a dataset (e.g., D4RL) often ignore algorithms proposed on another dataset (e.g., RL Unplugged), causing isolation among the algorithms, which might slow down the overall progress. Therefore, this work aims to bridge the gaps caused by low-level choices and datasets. To this end, we empirically investigate 20 implementation choices using three representative algorithms (i.e., CQL, CRR, and IQL) and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Mobile Crowdsensing and Crowdsourcing · Viral Infectious Diseases and Gene Expression in Insects
MethodsFocus
