Learning to Walk via Deep Reinforcement Learning

Tuomas Haarnoja; Sehoon Ha; Aurick Zhou; Jie Tan; George Tucker,; Sergey Levine

arXiv:1812.11103·cs.LG·June 20, 2019·42 cites

Learning to Walk via Deep Reinforcement Learning

Tuomas Haarnoja, Sehoon Ha, Aurick Zhou, Jie Tan, George Tucker,, Sergey Levine

PDF

Open Access

TL;DR

This paper introduces a sample-efficient deep reinforcement learning algorithm based on maximum entropy RL, enabling a quadruped robot to learn stable walking gait directly in the real world within two hours without prior modeling or simulation.

Contribution

The authors develop a minimal-tuning, sample-efficient deep RL method that successfully learns real-world robotic locomotion from scratch, outperforming previous approaches in efficiency and robustness.

Findings

01

Learns stable walking gait in about two hours on real robot

02

Achieves state-of-the-art performance on simulated benchmarks

03

Policy is robust to environmental variations

Abstract

Deep reinforcement learning (deep RL) holds the promise of automating the acquisition of complex controllers that can map sensory inputs directly to low-level actions. In the domain of robotic locomotion, deep RL could enable learning locomotion skills with minimal engineering and without an explicit model of the robot dynamics. Unfortunately, applying deep RL to real-world robotic tasks is exceptionally difficult, primarily due to poor sample complexity and sensitivity to hyperparameters. While hyperparameters can be easily tuned in simulated domains, tuning may be prohibitively expensive on physical systems, such as legged robots, that can be damaged through extensive trial-and-error learning. In this paper, we propose a sample-efficient deep RL algorithm based on maximum entropy RL that requires minimal per-task tuning and only a modest number of trials to learn neural network…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Locomotion and Control · Prosthetics and Rehabilitation Robotics · Reinforcement Learning in Robotics