MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale

Dmitry Kalashnikov; Jacob Varley; Yevgen Chebotar; Benjamin Swanson,; Rico Jonschkowski; Chelsea Finn; Sergey Levine; Karol Hausman

arXiv:2104.08212·cs.RO·April 29, 2021·35 cites

MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale

Dmitry Kalashnikov, Jacob Varley, Yevgen Chebotar, Benjamin Swanson,, Rico Jonschkowski, Chelsea Finn, Sergey Levine, Karol Hausman

PDF

Open Access

TL;DR

MT-Opt introduces a scalable multi-task reinforcement learning framework enabling a team of robots to learn, share, and generalize a diverse set of skills simultaneously, improving efficiency and adaptability in real-world tasks.

Contribution

The paper presents a novel scalable multi-task reinforcement learning system, MT-Opt, that allows continuous learning and sharing of skills across multiple robots and tasks.

Findings

01

Successfully learned 12 real-world tasks with 7 robots.

02

Demonstrated generalization to new, structurally similar tasks.

03

Enabled faster acquisition of new tasks by leveraging past experience.

Abstract

General-purpose robotic systems must master a large repertoire of diverse skills to be useful in a range of daily tasks. While reinforcement learning provides a powerful framework for acquiring individual behaviors, the time needed to acquire each skill makes the prospect of a generalist robot trained with RL daunting. In this paper, we study how a large-scale collective robotic learning system can acquire a repertoire of behaviors simultaneously, sharing exploration, experience, and representations across tasks. In this framework new tasks can be continuously instantiated from previously learned tasks improving overall performance and capabilities of the system. To instantiate this system, we develop a scalable and intuitive framework for specifying new tasks through user-provided examples of desired outcomes, devise a multi-robot collective learning system for data collection that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Modular Robots and Swarm Intelligence