Walk the PLANC: Physics-Guided RL for Agile Humanoid Locomotion on Constrained Footholds
Min Dai, William D. Compton, Junheng Li, Lizhi Yang, Aaron D. Ames

TL;DR
This paper presents a physics-guided reinforcement learning framework for humanoid robots to navigate constrained terrains with high accuracy and agility, combining structured planning with data-driven adaptation.
Contribution
It introduces a novel framework integrating a reduced-order stepping planner with RL using CLF rewards, enhancing reliability and precision in humanoid locomotion on complex terrains.
Findings
Achieved hardware-validated stepping-stone locomotion
Significantly improved reliability over model-free RL baselines
Produced accurate and agile movements on constrained terrains
Abstract
Bipedal humanoid robots must precisely coordinate balance, timing, and contact decisions when locomoting on constrained footholds such as stepping stones, beams, and planks -- even minor errors can lead to catastrophic failure. Classical optimization and control pipelines handle these constraints well but depend on highly accurate mathematical representations of terrain geometry, making them prone to error when perception is noisy or incomplete. Meanwhile, reinforcement learning has shown strong resilience to disturbances and modeling errors, yet end-to-end policies rarely discover the precise foothold placement and step sequencing required for discontinuous terrain. These contrasting limitations motivate approaches that guide learning with physics-based structure rather than relying purely on reward shaping. In this work, we introduce a locomotion framework in which a reduced-order…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Locomotion and Control · Human Motion and Animation · Reinforcement Learning in Robotics
