Learning High-Level Policies for Model Predictive Control

Yunlong Song; Davide Scaramuzza

arXiv:2007.10284·cs.RO·May 11, 2021

Learning High-Level Policies for Model Predictive Control

Yunlong Song, Davide Scaramuzza

PDF

1 Repo

TL;DR

This paper introduces High-MPC, a deep neural network-based high-level policy that enhances Model Predictive Control by enabling adaptive decision-making in dynamic environments, demonstrated on a simulated quadrotor task.

Contribution

It formulates high-level decision variable selection as a probabilistic inference problem and proposes a self-supervised learning algorithm for online adaptation in MPC.

Findings

01

High-MPC improves control performance in complex tasks.

02

The method enables online hyperparameter adaptation.

03

It outperforms standard MPC in dynamic environments.

Abstract

The combination of policy search and deep neural networks holds the promise of automating a variety of decision-making tasks. Model Predictive Control (MPC) provides robust solutions to robot control tasks by making use of a dynamical model of the system and solving an optimization problem online over a short planning horizon. In this work, we leverage probabilistic decision-making approaches and the generalization capability of artificial neural networks to the powerful online optimization by learning a deep high-level policy for the MPC (High-MPC). Conditioning on robot's local observations, the trained neural network policy is capable of adaptively selecting high-level decision variables for the low-level MPC controller, which then generates optimal control commands for the robot. First, we formulate the search of high-level decision variables for MPC as a policy search problem,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uzh-rpg/high_mpc
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.