DecAP: Decaying Action Priors for Accelerated Imitation Learning of Torque-Based Legged Locomotion Policies
Shivam Sood, Ge Sun, Peizhuo Li, Guillaume Sartoretti

TL;DR
This paper introduces DecAP, a two-stage imitation learning framework that improves torque-based legged robot locomotion by using decaying action priors and imitation rewards, leading to more natural gaits and robustness.
Contribution
The paper proposes a novel two-stage framework with decaying action priors to enhance exploration in torque-based policies for legged robots, outperforming traditional imitation learning methods.
Findings
DecAP outperforms pure imitation learning in torque-based policies.
DecAP is robust to reward scaling from 0.1x to 10x.
Torque control improves robustness over position-based policies.
Abstract
Optimal Control for legged robots has gone through a paradigm shift from position-based to torque-based control, owing to the latter's compliant and robust nature. In parallel to this shift, the community has also turned to Deep Reinforcement Learning (DRL) as a promising approach to directly learn locomotion policies for complex real-life tasks. However, most end-to-end DRL approaches still operate in position space, mainly because learning in torque space is often sample-inefficient and does not consistently converge to natural gaits. To address these challenges, we propose a two-stage framework. In the first stage, we generate our own imitation data by training a position-based policy, eliminating the need for expert knowledge to design optimal controllers. The second stage incorporates decaying action priors, a novel method to enhance the exploration of torque-based policies aided…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Robotic Locomotion and Control · Prosthetics and Rehabilitation Robotics
