Deep Model Predictive Optimization

Jacob Sacks; Rwik Rana; Kevin Huang; Alex Spitzer; Guanya Shi; Byron; Boots

arXiv:2310.04590·cs.RO·October 1, 2024

Deep Model Predictive Optimization

Jacob Sacks, Rwik Rana, Kevin Huang, Alex Spitzer, Guanya Shi, Byron, Boots

PDF

Open Access 1 Repo

TL;DR

Deep Model Predictive Optimization (DMPO) enhances control policies by learning the optimization process itself, leading to more robust and sample-efficient performance in complex robotics tasks like quadrotor flight.

Contribution

DMPO introduces a learned inner-loop optimization for MPC, improving robustness, efficiency, and adaptability over traditional MPC and model-free methods.

Findings

01

DMPO outperforms baseline MPC by up to 27% in performance.

02

DMPO achieves 19% better results than end-to-end MFRL policies.

03

DMPO requires 4.3 times less memory and fewer samples.

Abstract

A major challenge in robotics is to design robust policies which enable complex and agile behaviors in the real world. On one end of the spectrum, we have model-free reinforcement learning (MFRL), which is incredibly flexible and general but often results in brittle policies. In contrast, model predictive control (MPC) continually re-plans at each time step to remain robust to perturbations and model inaccuracies. However, despite its real-world successes, MPC often under-performs the optimal strategy. This is due to model quality, myopic behavior from short planning horizons, and approximations due to computational constraints. And even with a perfect model and enough compute, MPC can get stuck in bad local optima, depending heavily on the quality of the optimization algorithm. To this end, we propose Deep Model Predictive Optimization (DMPO), which learns the inner-loop of an MPC…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jisacks/dmpo
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMechanical Circulatory Support Devices · Cardiovascular Function and Risk Factors · Advanced Control Systems Optimization