Multi-Gait Learning for Humanoid Robots Using Reinforcement Learning with Selective Adversarial Motion Prior
Yuanye Wu, Keyi Wang, Linqi Ye, Boyang Xing

TL;DR
This paper introduces a multi-gait reinforcement learning approach for humanoid robots, employing a selective adversarial motion prior to improve convergence and stability across diverse locomotion skills.
Contribution
It proposes a novel selective AMP strategy that applies regularization selectively to stability-critical gaits, enhancing learning efficiency and performance.
Findings
Selective AMP accelerates convergence for stability gaits.
The approach achieves lower tracking error in simulation.
Zero-shot transfer enables deployment on real robots.
Abstract
Learning diverse locomotion skills for humanoid robots in a unified reinforcement learning framework remains challenging due to the conflicting requirements of stability and dynamic expressiveness across different gaits. We present a multi-gait learning approach that enables a humanoid robot to master five distinct gaits -- walking, goose-stepping, running, stair climbing, and jumping -- using a consistent policy structure, action space, and reward formulation. The key contribution is a selective Adversarial Motion Prior (AMP) strategy: AMP is applied to periodic, stability-critical gaits (walking, goose-stepping, stair climbing) where it accelerates convergence and suppresses erratic behavior, while being deliberately omitted for highly dynamic gaits (running, jumping) where its regularization would over-constrain the motion. Policies are trained via PPO with domain randomization in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
