TL;DR
This paper introduces High-MPC, a deep neural network-based high-level policy that enhances Model Predictive Control by enabling adaptive decision-making in dynamic environments, demonstrated on a simulated quadrotor task.
Contribution
It formulates high-level decision variable selection as a probabilistic inference problem and proposes a self-supervised learning algorithm for online adaptation in MPC.
Findings
High-MPC improves control performance in complex tasks.
The method enables online hyperparameter adaptation.
It outperforms standard MPC in dynamic environments.
Abstract
The combination of policy search and deep neural networks holds the promise of automating a variety of decision-making tasks. Model Predictive Control (MPC) provides robust solutions to robot control tasks by making use of a dynamical model of the system and solving an optimization problem online over a short planning horizon. In this work, we leverage probabilistic decision-making approaches and the generalization capability of artificial neural networks to the powerful online optimization by learning a deep high-level policy for the MPC (High-MPC). Conditioning on robot's local observations, the trained neural network policy is capable of adaptively selecting high-level decision variables for the low-level MPC controller, which then generates optimal control commands for the robot. First, we formulate the search of high-level decision variables for MPC as a policy search problem,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
