Improving and Benchmarking Offline Reinforcement Learning Algorithms

Bingyi Kang; Xiao Ma; Yirui Wang; Yang Yue; Shuicheng Yan

arXiv:2306.00972·cs.LG·June 2, 2023·1 cites

Improving and Benchmarking Offline Reinforcement Learning Algorithms

Bingyi Kang, Xiao Ma, Yirui Wang, Yang Yue, Shuicheng Yan

PDF

Open Access 1 Repo

TL;DR

This paper investigates the impact of low-level implementation choices and datasets on offline RL performance, providing a comprehensive benchmark and a guidebook that lead to new state-of-the-art results.

Contribution

It empirically analyzes 20 implementation choices across three algorithms, introduces a guidebook for implementation, and benchmarks multiple algorithms under a unified framework.

Findings

01

Implementation choices significantly affect performance.

02

Data distribution influences learning success.

03

Previous conclusions may be dataset-biased.

Abstract

Recently, Offline Reinforcement Learning (RL) has achieved remarkable progress with the emergence of various algorithms and datasets. However, these methods usually focus on algorithmic advancements, ignoring that many low-level implementation choices considerably influence or even drive the final performance. As a result, it becomes hard to attribute the progress in Offline RL as these choices are not sufficiently discussed and aligned in the literature. In addition, papers focusing on a dataset (e.g., D4RL) often ignore algorithms proposed on another dataset (e.g., RL Unplugged), causing isolation among the algorithms, which might slow down the overall progress. Therefore, this work aims to bridge the gaps caused by low-level choices and datasets. To this end, we empirically investigate 20 implementation choices using three representative algorithms (i.e., CQL, CRR, and IQL) and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sail-sg/offbench
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Mobile Crowdsensing and Crowdsourcing · Viral Infectious Diseases and Gene Expression in Insects

MethodsFocus