Zero-Shot Retargeting of Learned Quadruped Locomotion Policies Using   Hybrid Kinodynamic Model Predictive Control

He Li; Tingnan Zhang; Wenhao Yu; and Patrick M. Wensing

arXiv:2209.14123·cs.RO·September 29, 2022

Zero-Shot Retargeting of Learned Quadruped Locomotion Policies Using Hybrid Kinodynamic Model Predictive Control

He Li, Tingnan Zhang, Wenhao Yu, and Patrick M. Wensing

PDF

Open Access

TL;DR

This paper introduces a hybrid kinodynamic MPC framework that enables zero-shot transfer of learned quadruped locomotion policies across different robot platforms, reducing the need for retraining.

Contribution

The work presents a novel planning-and-control pipeline combining RL and MPC with a hybrid kinodynamic model for effective policy retargeting across diverse quadruped robots.

Findings

01

Successful transfer of policies from A1 and Laikago to MIT Mini Cheetah

02

No policy re-tuning required during transfer

03

Robust and stable locomotion demonstrated on hardware

Abstract

Reinforcement Learning (RL) has witnessed great strides for quadruped locomotion, with continued progress in the reliable sim-to-real transfer of policies. However, it remains a challenge to reuse a policy on another robot, which could save time for retraining. In this work, we present a framework for zero-shot policy retargeting wherein diverse motor skills can be transferred between robots of different shapes and sizes. The new framework centers on a planning-and-control pipeline that systematically integrates RL and Model Predictive Control (MPC). The planning stage employs RL to generate a dynamically plausible trajectory as well as the contact schedule, avoiding the combinatorial complexity of contact sequence optimization. This information is then used to seed the MPC to stabilize and robustify the policy roll-out via a new Hybrid Kinodynamic (HKD) model that implicitly optimizes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Locomotion and Control · Prosthetics and Rehabilitation Robotics · Reinforcement Learning in Robotics