Reset-Free Reinforcement Learning via Multi-Task Learning: Learning Dexterous Manipulation Behaviors without Human Intervention
Abhishek Gupta, Justin Yu, Tony Z. Zhao, Vikash Kumar, Aaron Rovinsky,, Kelvin Xu, Thomas Devlin, Sergey Levine

TL;DR
This paper introduces a multi-task reinforcement learning approach that enables robots to learn complex dexterous manipulation skills autonomously without human intervention or resets, significantly improving scalability and practicality.
Contribution
The work demonstrates that multi-task RL can inherently solve the reset problem, allowing for reset-free learning of complex manipulation tasks in real-world robots.
Findings
Successfully learned dexterous manipulation tasks without resets
Multi-task learning alleviates the need for explicit resets in RL
Effective in both simulation and real-world experiments
Abstract
Reinforcement Learning (RL) algorithms can in principle acquire complex robotic skills by learning from large amounts of data in the real world, collected via trial and error. However, most RL algorithms use a carefully engineered setup in order to collect data, requiring human supervision and intervention to provide episodic resets. This is particularly evident in challenging robotics problems, such as dexterous manipulation. To make data collection scalable, such applications require reset-free algorithms that are able to learn autonomously, without explicit instrumentation or human intervention. Most prior work in this area handles single-task learning. However, we might also want robots that can perform large repertoires of skills. At first, this would appear to only make the problem harder. However, the key observation we make in this work is that an appropriately chosen multi-task…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
