DecAP: Decaying Action Priors for Accelerated Imitation Learning of   Torque-Based Legged Locomotion Policies

Shivam Sood; Ge Sun; Peizhuo Li; Guillaume Sartoretti

arXiv:2310.05714·cs.RO·September 4, 2024

DecAP: Decaying Action Priors for Accelerated Imitation Learning of Torque-Based Legged Locomotion Policies

Shivam Sood, Ge Sun, Peizhuo Li, Guillaume Sartoretti

PDF

Open Access 1 Repo

TL;DR

This paper introduces DecAP, a two-stage imitation learning framework that improves torque-based legged robot locomotion by using decaying action priors and imitation rewards, leading to more natural gaits and robustness.

Contribution

The paper proposes a novel two-stage framework with decaying action priors to enhance exploration in torque-based policies for legged robots, outperforming traditional imitation learning methods.

Findings

01

DecAP outperforms pure imitation learning in torque-based policies.

02

DecAP is robust to reward scaling from 0.1x to 10x.

03

Torque control improves robustness over position-based policies.

Abstract

Optimal Control for legged robots has gone through a paradigm shift from position-based to torque-based control, owing to the latter's compliant and robust nature. In parallel to this shift, the community has also turned to Deep Reinforcement Learning (DRL) as a promising approach to directly learn locomotion policies for complex real-life tasks. However, most end-to-end DRL approaches still operate in position space, mainly because learning in torque space is often sample-inefficient and does not consistently converge to natural gaits. To address these challenges, we propose a two-stage framework. In the first stage, we generate our own imitation data by training a position-based policy, eliminating the need for expert knowledge to design optimal controllers. The second stage incorporates decaying action priors, a novel method to enhance the exploration of torque-based policies aided…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

marmotlab/decaying_action_priors
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Robotic Locomotion and Control · Prosthetics and Rehabilitation Robotics