MPC-Net: A First Principles Guided Policy Search

Jan Carius; Farbod Farshidian; Marco Hutter

arXiv:1909.05197·cs.RO·February 18, 2020

MPC-Net: A First Principles Guided Policy Search

Jan Carius, Farbod Farshidian, Marco Hutter

PDF

1 Repo

TL;DR

MPC-Net introduces a novel imitation learning method guided by optimal control principles, enabling efficient learning of control policies that satisfy constraints and adapt to multimodal behaviors in robotic systems.

Contribution

The paper proposes a control policy learning approach using a loss function based on the control Hamiltonian, directly encoding optimality and constraints, with a mixture-of-expert neural network for quadrupedal robot control.

Findings

01

Successfully stabilizes multiple gaits on a real robot

02

Requires less than 10 minutes of demonstration data

03

Achieves improved constraint satisfaction

Abstract

We present an Imitation Learning approach for the control of dynamical systems with a known model. Our policy search method is guided by solutions from MPC. Typical policy search methods of this kind minimize a distance metric between the guiding demonstrations and the learned policy. Our loss function, however, corresponds to the minimization of the control Hamiltonian, which derives from the principle of optimality. Therefore, our algorithm directly attempts to solve the optimality conditions with a parameterized class of control laws. Additionally, the proposed loss function explicitly encodes the constraints of the optimal control problem and we provide numerical evidence that its minimization achieves improved constraint satisfaction. We train a mixture-of-expert neural network architecture for controlling a quadrupedal robot and show that this policy structure is well suited for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

leggedrobotics/MPC-Net
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.