APEX: Action Priors Enable Efficient Exploration for Robust Motion Tracking on Legged Robots

Shivam Sood; Laukik Nakhwa; Sun Ge; Yuhong Cao; Jin Cheng; Fatemah Zargarbashi; Taerim Yoon; Sungjoon Choi; Stelian Coros; and Guillaume Sartoretti

arXiv:2511.09091·cs.RO·November 20, 2025

APEX: Action Priors Enable Efficient Exploration for Robust Motion Tracking on Legged Robots

Shivam Sood, Laukik Nakhwa, Sun Ge, Yuhong Cao, Jin Cheng, Fatemah Zargarbashi, Taerim Yoon, Sungjoon Choi, Stelian Coros, and Guillaume Sartoretti

PDF

Open Access

TL;DR

APEX introduces a novel reinforcement learning extension for legged robots that uses decaying action priors from demonstrations, enabling efficient, robust, and adaptable motion learning without reliance on reference data during deployment.

Contribution

The paper presents APEX, a plug-and-play method that integrates expert demonstrations into RL with decaying priors, improving exploration, sample efficiency, and generalization in legged robot locomotion.

Findings

01

Enhanced sample efficiency in learning locomotion

02

Robust transfer of styles across terrains and velocities

03

Reduced tuning and dependence on reference data

Abstract

Learning natural, animal-like locomotion from demonstrations has become a core paradigm in legged robotics. Despite the recent advancements in motion tracking, most existing methods demand extensive tuning and rely on reference data during deployment, limiting adaptability. We present APEX (Action Priors enable Efficient Exploration), a plug-and-play extension to state-of-the-art motion tracking algorithms that eliminates any dependence on reference data during deployment, improves sample efficiency, and reduces parameter tuning effort. APEX integrates expert demonstrations directly into reinforcement learning (RL) by incorporating decaying action priors, which initially bias exploration toward expert demonstrations but gradually allow the policy to explore independently. This is combined with a multi-critic framework that balances task performance with motion style. Moreover, APEX…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Locomotion and Control · Zebrafish Biomedical Research Applications · Robot Manipulation and Learning