Concurrent Training of a Control Policy and a State Estimator for   Dynamic and Robust Legged Locomotion

Gwanghyeon Ji; Juhyeok Mun; Hyeongjun Kim; Jemin Hwangbo

arXiv:2202.05481·cs.RO·March 3, 2022

Concurrent Training of a Control Policy and a State Estimator for Dynamic and Robust Legged Locomotion

Gwanghyeon Ji, Juhyeok Mun, Hyeongjun Kim, Jemin Hwangbo

PDF

1 Repo

TL;DR

This paper introduces a concurrent training framework for a control policy and state estimator in legged robots, enabling robust and dynamic locomotion across diverse terrains with high speeds.

Contribution

It presents a novel framework for simultaneous training of control and estimation networks, successfully transferring to real robots for versatile terrain traversal.

Findings

01

Able to traverse diverse terrains including hills, slippery, and bumpy surfaces.

02

Achieves speeds up to 3.75 m/s on flat ground and 3.54 m/s on slippery surfaces.

03

Demonstrates effective real-world transfer of trained networks.

Abstract

In this paper, we propose a locomotion training framework where a control policy and a state estimator are trained concurrently. The framework consists of a policy network which outputs the desired joint positions and a state estimation network which outputs estimates of the robot's states such as the base linear velocity, foot height, and contact probability. We exploit a fast simulation environment to train the networks and the trained networks are transferred to the real robot. The trained policy and state estimator are capable of traversing diverse terrains such as a hill, slippery plate, and bumpy road. We also demonstrate that the learned policy can run at up to 3.75 m/s on normal flat ground and 3.54 m/s on a slippery plate with the coefficient of friction of 0.22.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

karlji1021/cheetah-software
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsBalanced Selection