Deep Model Predictive Optimization
Jacob Sacks, Rwik Rana, Kevin Huang, Alex Spitzer, Guanya Shi, Byron, Boots

TL;DR
Deep Model Predictive Optimization (DMPO) enhances control policies by learning the optimization process itself, leading to more robust and sample-efficient performance in complex robotics tasks like quadrotor flight.
Contribution
DMPO introduces a learned inner-loop optimization for MPC, improving robustness, efficiency, and adaptability over traditional MPC and model-free methods.
Findings
DMPO outperforms baseline MPC by up to 27% in performance.
DMPO achieves 19% better results than end-to-end MFRL policies.
DMPO requires 4.3 times less memory and fewer samples.
Abstract
A major challenge in robotics is to design robust policies which enable complex and agile behaviors in the real world. On one end of the spectrum, we have model-free reinforcement learning (MFRL), which is incredibly flexible and general but often results in brittle policies. In contrast, model predictive control (MPC) continually re-plans at each time step to remain robust to perturbations and model inaccuracies. However, despite its real-world successes, MPC often under-performs the optimal strategy. This is due to model quality, myopic behavior from short planning horizons, and approximations due to computational constraints. And even with a perfect model and enough compute, MPC can get stuck in bad local optima, depending heavily on the quality of the optimization algorithm. To this end, we propose Deep Model Predictive Optimization (DMPO), which learns the inner-loop of an MPC…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMechanical Circulatory Support Devices · Cardiovascular Function and Risk Factors · Advanced Control Systems Optimization
