D4RL: Datasets for Deep Data-Driven Reinforcement Learning

Justin Fu; Aviral Kumar; Ofir Nachum; George Tucker; Sergey Levine

arXiv:2004.07219·cs.LG·February 9, 2021·332 cites

D4RL: Datasets for Deep Data-Driven Reinforcement Learning

Justin Fu, Aviral Kumar, Ofir Nachum, George Tucker, Sergey Levine

PDF

Open Access 5 Repos 2 Datasets 1 Video

TL;DR

This paper introduces new offline RL benchmarks with diverse datasets to better evaluate and advance offline reinforcement learning algorithms, addressing limitations of previous benchmarks.

Contribution

The authors present a set of offline RL benchmarks with varied datasets, along with evaluation protocols and open-source tools to facilitate research and identify algorithm shortcomings.

Findings

01

Existing algorithms reveal significant deficiencies on new benchmarks.

02

Diverse datasets expose limitations of current offline RL methods.

03

Benchmark resources are publicly available for community use.

Abstract

The offline reinforcement learning (RL) setting (also known as full batch RL), where a policy is learned from a static dataset, is compelling as progress enables RL methods to take advantage of large, previously-collected datasets, much like how the rise of large datasets has fueled results in supervised learning. However, existing online RL benchmarks are not tailored towards the offline setting and existing offline RL benchmarks are restricted to data generated by partially-trained agents, making progress in offline RL difficult to measure. In this work, we introduce benchmarks specifically designed for the offline setting, guided by key properties of datasets relevant to real-world applications of offline RL. With a focus on dataset collection, examples of such properties include: datasets generated via hand-designed controllers and human demonstrators, multitask datasets where an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

Videos

Datasets for Data-Driven Reinforcement Learning· youtube

Taxonomy

TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Autonomous Vehicle Technology and Safety