APEX: Action Priors Enable Efficient Exploration for Robust Motion Tracking on Legged Robots
Shivam Sood, Laukik Nakhwa, Sun Ge, Yuhong Cao, Jin Cheng, Fatemah Zargarbashi, Taerim Yoon, Sungjoon Choi, Stelian Coros, and Guillaume Sartoretti

TL;DR
APEX introduces a novel reinforcement learning extension for legged robots that uses decaying action priors from demonstrations, enabling efficient, robust, and adaptable motion learning without reliance on reference data during deployment.
Contribution
The paper presents APEX, a plug-and-play method that integrates expert demonstrations into RL with decaying priors, improving exploration, sample efficiency, and generalization in legged robot locomotion.
Findings
Enhanced sample efficiency in learning locomotion
Robust transfer of styles across terrains and velocities
Reduced tuning and dependence on reference data
Abstract
Learning natural, animal-like locomotion from demonstrations has become a core paradigm in legged robotics. Despite the recent advancements in motion tracking, most existing methods demand extensive tuning and rely on reference data during deployment, limiting adaptability. We present APEX (Action Priors enable Efficient Exploration), a plug-and-play extension to state-of-the-art motion tracking algorithms that eliminates any dependence on reference data during deployment, improves sample efficiency, and reduces parameter tuning effort. APEX integrates expert demonstrations directly into reinforcement learning (RL) by incorporating decaying action priors, which initially bias exploration toward expert demonstrations but gradually allow the policy to explore independently. This is combined with a multi-critic framework that balances task performance with motion style. Moreover, APEX…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Locomotion and Control · Zebrafish Biomedical Research Applications · Robot Manipulation and Learning
