Parrot: Data-Driven Behavioral Priors for Reinforcement Learning

Avi Singh; Huihan Liu; Gaoyue Zhou; Albert Yu; Nicholas Rhinehart,; Sergey Levine

arXiv:2011.10024·cs.LG·November 20, 2020·27 cites

Parrot: Data-Driven Behavioral Priors for Reinforcement Learning

Avi Singh, Huihan Liu, Gaoyue Zhou, Albert Yu, Nicholas Rhinehart,, Sergey Levine

PDF

Open Access 1 Video

TL;DR

This paper introduces Parrot, a pre-training approach for reinforcement learning that leverages behavioral priors learned from diverse tasks to enable rapid adaptation to new tasks, especially in robotic manipulation with image inputs.

Contribution

It proposes a novel pre-training method for RL that captures complex behaviors from previous tasks to facilitate quick learning of new tasks without limiting exploration.

Findings

01

Outperforms prior methods significantly in robotic manipulation tasks

02

Effective in environments with image observations and sparse rewards

03

Enables rapid adaptation to new tasks using behavioral priors

Abstract

Reinforcement learning provides a general framework for flexible decision making and control, but requires extensive data collection for each new task that an agent needs to learn. In other machine learning fields, such as natural language processing or computer vision, pre-training on large, previously collected datasets to bootstrap learning for new tasks has emerged as a powerful paradigm to reduce data requirements when learning a new task. In this paper, we ask the following question: how can we enable similarly useful pre-training for RL agents? We propose a method for pre-training behavioral priors that can capture complex input-output relationships observed in successful trials from a wide range of previously seen tasks, and we show how this learned prior can be used for rapidly learning new tasks without impeding the RL agent's ability to try out novel behaviors. We demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Parrot: Data-Driven Behavioral Priors for Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning