# Data-driven Economic NMPC using Reinforcement Learning

**Authors:** S\'ebastien Gros, Mario Zanon

arXiv: 1904.04152 · 2020-09-18

## TL;DR

This paper demonstrates that Economic NMPC can be tuned to achieve optimal control policies even with inaccurate models, bridging the gap between data-driven RL and model-based control for complex systems.

## Contribution

It introduces a method to tune ENMPC to match the optimal policy of the real system despite model inaccuracies, including stochastic dynamics, and connects this to dissipativity theory.

## Key findings

- ENMPC can be used as a function approximator in RL.
- The tuning method works with stochastic system dynamics.
- Results validated on linear and nonlinear control examples.

## Abstract

Reinforcement Learning (RL) is a powerful tool to perform data-driven optimal control without relying on a model of the system. However, RL struggles to provide hard guarantees on the behavior of the resulting control scheme. In contrast, Nonlinear Model Predictive Control (NMPC) and Economic NMPC (ENMPC) are standard tools for the closed-loop optimal control of complex systems with constraints and limitations, and benefit from a rich theory to assess their closed-loop behavior. Unfortunately, the performance of (E)NMPC hinges on the quality of the model underlying the control scheme. In this paper, we show that an (E)NMPC scheme can be tuned to deliver the optimal policy of the real system even when using a wrong model. This result also holds for real systems having stochastic dynamics. This entails that ENMPC can be used as a new type of function approximator within RL. Furthermore, we investigate our results in the context of ENMPC and formally connect them to the concept of dissipativity, which is central for the ENMPC stability. Finally, we detail how these results can be used to deploy classic RL tools for tuning (E)NMPC schemes. We apply these tools on both a classical linear MPC setting and a standard nonlinear example from the ENMPC literature.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.04152/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1904.04152/full.md

## References

42 references — full list in the complete paper: https://tomesphere.com/paper/1904.04152/full.md

---
Source: https://tomesphere.com/paper/1904.04152