BeamDojo: Learning Agile Humanoid Locomotion on Sparse Footholds
Huayi Wang, Zirui Wang, Junli Ren, Qingwei Ben, Tao Huang, Weinan, Zhang, Jiangmiao Pang

TL;DR
BeamDojo is a reinforcement learning framework that enables humanoid robots to traverse complex terrains with sparse footholds by combining novel reward functions, a two-stage training process, and real-world LiDAR integration.
Contribution
The paper introduces BeamDojo, a new RL framework with a polygonal foot reward, double critic, and two-stage training for agile humanoid locomotion on sparse footholds.
Findings
Efficient learning in simulation for sparse foothold terrains.
Successful real-world deployment with precise foot placement.
High success rate under external disturbances.
Abstract
Traversing risky terrains with sparse footholds poses a significant challenge for humanoid robots, requiring precise foot placements and stable locomotion. Existing learning-based approaches often struggle on such complex terrains due to sparse foothold rewards and inefficient learning processes. To address these challenges, we introduce BeamDojo, a reinforcement learning (RL) framework designed for enabling agile humanoid locomotion on sparse footholds. BeamDojo begins by introducing a sampling-based foothold reward tailored for polygonal feet, along with a double critic to balancing the learning process between dense locomotion rewards and sparse foothold rewards. To encourage sufficient trial-and-error exploration, BeamDojo incorporates a two-stage RL approach: the first stage relaxes the terrain dynamics by training the humanoid on flat terrain while providing it with task-terrain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Robotic Locomotion and Control · Context-Aware Activity Recognition Systems
