COG: Connecting New Skills to Past Experience with Offline Reinforcement   Learning

Avi Singh; Albert Yu; Jonathan Yang; Jesse Zhang; Aviral Kumar; Sergey; Levine

arXiv:2010.14500·cs.LG·October 28, 2020·38 cites

COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning

Avi Singh, Albert Yu, Jonathan Yang, Jesse Zhang, Aviral Kumar, Sergey, Levine

PDF

Open Access 1 Repo

TL;DR

This paper introduces COG, a method that leverages prior offline data to extend and generalize robotic skills through dynamic programming, enabling the composition of multiple behaviors for new tasks in simulation and real-world settings.

Contribution

The paper presents a novel approach to reuse prior offline data for skill extension via dynamic programming, without requiring explicit skill hierarchies or decompositions.

Findings

01

Effective chaining of multiple skills in new tasks

02

Successful transfer from simulation to real robots

03

Improved policy performance using prior data

Abstract

Reinforcement learning has been applied to a wide variety of robotics problems, but most of such applications involve collecting data from scratch for each new task. Since the amount of robot data we can collect for any single task is limited by time and cost considerations, the learned behavior is typically narrow: the policy can only execute the task in a handful of scenarios that it was trained on. What if there was a way to incorporate a large amount of prior data, either from previously solved tasks or from unsupervised or undirected environment interaction, to extend and generalize learned behaviors? While most prior work on extending robotic skills using pre-collected data focuses on building explicit hierarchies or skill decompositions, we show in this paper that we can reuse prior data to extend new skills simply through dynamic programming. We show that even when the prior…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

avisingh599/cog
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Machine Learning and Algorithms