Reinforcement Learning for Legged Robots: Motion Imitation from   Model-Based Optimal Control

AJ Miller; Shamel Fahmi; Matthew Chignoli; and Sangbae Kim

arXiv:2305.10989·cs.RO·May 19, 2023·2 cites

Reinforcement Learning for Legged Robots: Motion Imitation from Model-Based Optimal Control

AJ Miller, Shamel Fahmi, Matthew Chignoli, and Sangbae Kim

PDF

Open Access

TL;DR

This paper introduces MIMOC, a reinforcement learning controller for legged robots that learns agile locomotion by imitating dynamically consistent, model-based optimal control trajectories, reducing the need for fine-tuning and improving robustness.

Contribution

MIMOC is a novel RL approach that imitates reference trajectories including torque references, enhancing robustness and reducing fine-tuning compared to prior imitation methods.

Findings

01

MIMOC outperforms traditional model-based controllers on challenging terrains.

02

Imitating torque references improves policy performance.

03

MIMOC demonstrates successful real-world deployment on Mini-Cheetah.

Abstract

We propose MIMOC: Motion Imitation from Model-Based Optimal Control. MIMOC is a Reinforcement Learning (RL) controller that learns agile locomotion by imitating reference trajectories from model-based optimal control. MIMOC mitigates challenges faced by other motion imitation RL approaches because the references are dynamically consistent, require no motion retargeting, and include torque references. Hence, MIMOC does not require fine-tuning. MIMOC is also less sensitive to modeling and state estimation inaccuracies than model-based controllers. We validate MIMOC on the Mini-Cheetah in outdoor environments over a wide variety of challenging terrain, and on the MIT Humanoid in simulation. We show cases where MIMOC outperforms model-based optimal controllers, and show that imitating torque references improves the policy's performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Locomotion and Control · Reinforcement Learning in Robotics · Prosthetics and Rehabilitation Robotics