A Workflow for Offline Model-Free Robotic Reinforcement Learning

Aviral Kumar; Anikait Singh; Stephen Tian; Chelsea Finn; Sergey Levine

arXiv:2109.10813·cs.LG·September 24, 2021·6 cites

A Workflow for Offline Model-Free Robotic Reinforcement Learning

Aviral Kumar, Anikait Singh, Stephen Tian, Chelsea Finn, Sergey Levine

PDF

Open Access 1 Repo

TL;DR

This paper presents a practical workflow for offline reinforcement learning in robotics, including metrics and guidelines to improve policy performance without online tuning, validated on simulated and real robotic tasks.

Contribution

It introduces a set of metrics and a systematic workflow for offline RL that guides design choices and hyperparameters without online evaluation, bridging the gap to real-world robotic applications.

Findings

01

Effective policies learned without online tuning

02

Workflow improves offline RL performance in robotic tasks

03

Validated on simulated and real robots with raw image inputs

Abstract

Offline reinforcement learning (RL) enables learning control policies by utilizing only prior experience, without any online interaction. This can allow robots to acquire generalizable skills from large and diverse datasets, without any costly or unsafe online data collection. Despite recent algorithmic advances in offline RL, applying these methods to real-world problems has proven challenging. Although offline RL methods can learn from prior data, there is no clear and well-understood process for making various design choices, from model architecture to algorithm hyperparameters, without actually evaluating the learned policies online. In this paper, our aim is to develop a practical workflow for using offline RL analogous to the relatively well-understood workflows for supervised learning problems. To this end, we devise a set of metrics and conditions that can be tracked over the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Asap7772/OfflineRlWorkflow
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms · Advanced Bandit Algorithms Research