Reinforcement Learning for Versatile, Dynamic, and Robust Bipedal Locomotion Control
Zhongyu Li, Xue Bin Peng, Pieter Abbeel, Sergey Levine, Glen Berseth,, Koushil Sreenath

TL;DR
This paper develops a versatile deep reinforcement learning-based control system for bipedal robots, enabling a wide range of dynamic skills with robustness and adaptability demonstrated in both simulation and real-world experiments.
Contribution
Introduces a novel dual-history RL architecture for general bipedal locomotion control, capable of handling diverse skills and environmental changes.
Findings
Outperforms existing methods across multiple skills
Successfully deployed on real robot Cassie
Achieves robust and adaptable locomotion in real-world tests
Abstract
This paper presents a comprehensive study on using deep reinforcement learning (RL) to create dynamic locomotion controllers for bipedal robots. Going beyond focusing on a single locomotion skill, we develop a general control solution that can be used for a range of dynamic bipedal skills, from periodic walking and running to aperiodic jumping and standing. Our RL-based controller incorporates a novel dual-history architecture, utilizing both a long-term and short-term input/output (I/O) history of the robot. This control architecture, when trained through the proposed end-to-end RL approach, consistently outperforms other methods across a diverse range of skills in both simulation and the real world. The study also delves into the adaptivity and robustness introduced by the proposed RL system in developing locomotion controllers. We demonstrate that the proposed architecture can adapt…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Locomotion and Control · Muscle activation and electromyography studies · Robot Manipulation and Learning
MethodsSparse Evolutionary Training
