Unified Walking, Running, and Recovery for Humanoids via State-Dependent Adversarial Motion Priors
Yidan Lu, Yichao Zhong, Liu Zhao, Wanyue Li, Peng Lu

TL;DR
This paper introduces a unified reinforcement learning framework enabling a humanoid robot to perform walking, running, and fall recovery seamlessly without explicit mode switching, validated through real-world experiments.
Contribution
It extends Adversarial Motion Priors with a state-dependent gating mechanism for unified control of multiple locomotion modes on hardware.
Findings
Successful recovery from prone and supine falls.
Smooth transition between walking and running.
Single policy operates at 50 Hz without mode switching logic.
Abstract
We propose a unified reinforcement learning framework that enables a single policy to perform walking, running, and fall recovery on the Unitree G1 humanoid robot, validated on physical hardware without any explicit mode-switching command at deployment. The framework extends Adversarial Motion Priors (AMP) by replacing the conventional global reference distribution with a state-dependent gate that routes each training transition to one of two discriminators: a dedicated recovery discriminator and a velocity-conditioned locomotion discriminator that jointly covers walking and running. The gate is defined by a single fixed threshold on projected gravity: the recovery discriminator is activated when body tilt exceeds approximately from vertical (); otherwise the locomotion discriminator is used, with the normalized commanded velocity serving as a condition that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
