Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement   Learning

Nikita Rudin; David Hoeller; Philipp Reist; and Marco Hutter

arXiv:2109.11978·cs.RO·August 22, 2022

Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement Learning

Nikita Rudin, David Hoeller, Philipp Reist, and Marco Hutter

PDF

5 Repos

TL;DR

This paper introduces a massively parallel deep reinforcement learning setup that enables rapid training of robotic walking policies within minutes, significantly accelerating the development process for legged robots.

Contribution

The authors develop a novel massively parallel training framework and a game-inspired curriculum, achieving rapid policy learning for quadrupedal robots on challenging terrains.

Findings

01

Training policies for flat terrain in under four minutes.

02

Training policies for uneven terrain in twenty minutes.

03

Successful transfer of policies from simulation to real robot.

Abstract

In this work, we present and study a training set-up that achieves fast policy generation for real-world robotic tasks by using massive parallelism on a single workstation GPU. We analyze and discuss the impact of different training algorithm components in the massively parallel regime on the final policy performance and training times. In addition, we present a novel game-inspired curriculum that is well suited for training with thousands of simulated robots in parallel. We evaluate the approach by training the quadrupedal robot ANYmal to walk on challenging terrain. The parallel approach allows training policies for flat terrain in under four minutes, and in twenty minutes for uneven terrain. This represents a speedup of multiple orders of magnitude compared to previous work. Finally, we transfer the policies to the real robot to validate the approach. We open-source our training code…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.