Learning to Walk via Deep Reinforcement Learning
Tuomas Haarnoja, Sehoon Ha, Aurick Zhou, Jie Tan, George Tucker,, Sergey Levine

TL;DR
This paper introduces a sample-efficient deep reinforcement learning algorithm based on maximum entropy RL, enabling a quadruped robot to learn stable walking gait directly in the real world within two hours without prior modeling or simulation.
Contribution
The authors develop a minimal-tuning, sample-efficient deep RL method that successfully learns real-world robotic locomotion from scratch, outperforming previous approaches in efficiency and robustness.
Findings
Learns stable walking gait in about two hours on real robot
Achieves state-of-the-art performance on simulated benchmarks
Policy is robust to environmental variations
Abstract
Deep reinforcement learning (deep RL) holds the promise of automating the acquisition of complex controllers that can map sensory inputs directly to low-level actions. In the domain of robotic locomotion, deep RL could enable learning locomotion skills with minimal engineering and without an explicit model of the robot dynamics. Unfortunately, applying deep RL to real-world robotic tasks is exceptionally difficult, primarily due to poor sample complexity and sensitivity to hyperparameters. While hyperparameters can be easily tuned in simulated domains, tuning may be prohibitively expensive on physical systems, such as legged robots, that can be damaged through extensive trial-and-error learning. In this paper, we propose a sample-efficient deep RL algorithm based on maximum entropy RL that requires minimal per-task tuning and only a modest number of trials to learn neural network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Locomotion and Control · Prosthetics and Rehabilitation Robotics · Reinforcement Learning in Robotics
