Jump-Start Reinforcement Learning with Self-Evolving Priors for Extreme Monopedal Locomotion
Ziang Zheng, Guojian Zhan, Shiqi Liu, Yao Lyu, Tao Zhang, Shengbo Eben Li

TL;DR
This paper introduces JumpER, a reinforcement learning framework that uses self-evolving priors and staged curriculum learning to enable quadruped robots to perform robust monopedal hopping across challenging terrains.
Contribution
The paper presents a novel RL training method that progressively refines policies through self-evolving priors, achieving stable learning without external experts or handcrafted rewards.
Findings
Enables quadruped robots to perform monopedal hopping on complex terrains.
Successfully handles wide gaps, irregular stairs, and variable stepping stones.
Achieves robust locomotion under extreme underactuation and terrain challenges.
Abstract
Reinforcement learning (RL) has shown great potential in enabling quadruped robots to perform agile locomotion. However, directly training policies to simultaneously handle dual extreme challenges, i.e., extreme underactuation and extreme terrains, as in monopedal hopping tasks, remains highly challenging due to unstable early-stage interactions and unreliable reward feedback. To address this, we propose JumpER (jump-start reinforcement learning via self-evolving priors), an RL training framework that structures policy learning into multiple stages of increasing complexity. By dynamically generating self-evolving priors through iterative bootstrapping of previously learned policies, JumpER progressively refines and enhances guidance, thereby stabilizing exploration and policy optimization without relying on external expert priors or handcrafted reward shaping. Specifically, when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Locomotion and Control · Reinforcement Learning in Robotics · Robot Manipulation and Learning
