Reset-Free Reinforcement Learning via Multi-Task Learning: Learning   Dexterous Manipulation Behaviors without Human Intervention

Abhishek Gupta; Justin Yu; Tony Z. Zhao; Vikash Kumar; Aaron Rovinsky,; Kelvin Xu; Thomas Devlin; Sergey Levine

arXiv:2104.11203·cs.LG·April 23, 2021

Reset-Free Reinforcement Learning via Multi-Task Learning: Learning Dexterous Manipulation Behaviors without Human Intervention

Abhishek Gupta, Justin Yu, Tony Z. Zhao, Vikash Kumar, Aaron Rovinsky,, Kelvin Xu, Thomas Devlin, Sergey Levine

PDF

TL;DR

This paper introduces a multi-task reinforcement learning approach that enables robots to learn complex dexterous manipulation skills autonomously without human intervention or resets, significantly improving scalability and practicality.

Contribution

The work demonstrates that multi-task RL can inherently solve the reset problem, allowing for reset-free learning of complex manipulation tasks in real-world robots.

Findings

01

Successfully learned dexterous manipulation tasks without resets

02

Multi-task learning alleviates the need for explicit resets in RL

03

Effective in both simulation and real-world experiments

Abstract

Reinforcement Learning (RL) algorithms can in principle acquire complex robotic skills by learning from large amounts of data in the real world, collected via trial and error. However, most RL algorithms use a carefully engineered setup in order to collect data, requiring human supervision and intervention to provide episodic resets. This is particularly evident in challenging robotics problems, such as dexterous manipulation. To make data collection scalable, such applications require reset-free algorithms that are able to learn autonomously, without explicit instrumentation or human intervention. Most prior work in this area handles single-task learning. However, we might also want robots that can perform large repertoires of skills. At first, this would appear to only make the problem harder. However, the key observation we make in this work is that an appropriately chosen multi-task…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.