Offline Reinforcement Learning: Tutorial, Review, and Perspectives on   Open Problems

Sergey Levine; Aviral Kumar; George Tucker; Justin Fu

arXiv:2005.01643·cs.LG·November 3, 2020·793 cites

Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems

Sergey Levine, Aviral Kumar, George Tucker, Justin Fu

PDF

Open Access 3 Repos 10 Models

TL;DR

This paper provides a comprehensive tutorial and review of offline reinforcement learning, discussing its potential, current challenges, recent solutions, and open problems to guide future research in the field.

Contribution

It offers an in-depth overview of offline reinforcement learning, highlighting key challenges, recent advancements, and open questions to advance understanding and development in the field.

Findings

01

Offline RL can leverage large datasets for decision making.

02

Current algorithms face limitations in policy optimization.

03

Recent methods show promise in addressing offline RL challenges.

Abstract

In this tutorial article, we aim to provide the reader with the conceptual tools needed to get started on research on offline reinforcement learning algorithms: reinforcement learning algorithms that utilize previously collected data, without additional online data collection. Offline reinforcement learning algorithms hold tremendous promise for making it possible to turn large datasets into powerful decision making engines. Effective offline reinforcement learning methods would be able to extract policies with the maximum possible utility out of the available data, thereby allowing automation of a wide range of decision-making domains, from healthcare and education to robotics. However, the limitations of current algorithms make this difficult. We will aim to provide the reader with an understanding of these challenges, particularly in the context of modern deep reinforcement learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Scheduling and Optimization Algorithms