Economic MPC of Markov Decision Processes: Dissipativity in Undiscounted   Infinite-Horizon Optimal Control

S\'ebastien Gros; Mario Zanon

arXiv:2104.10997·eess.SY·July 25, 2022

Economic MPC of Markov Decision Processes: Dissipativity in Undiscounted Infinite-Horizon Optimal Control

S\'ebastien Gros, Mario Zanon

PDF

Open Access

TL;DR

This paper extends dissipativity theory in economic MPC to stochastic Markov Decision Processes, enabling stability analysis for a broader class of problems using nonlinear stage costs and probability measures.

Contribution

It introduces a generalized dissipativity framework for undiscounted infinite-horizon MDPs, addressing limitations of existing theories for stochastic systems.

Findings

01

Applied to stochastic LQ regulator with Gaussian noise

02

Explicit storage functional derived for the LQ case

03

Framework supports Lyapunov stability analysis in stochastic settings

Abstract

Economic Model Predictive Control (MPC) dissipativity theory is central to discussing the stability of policies resulting from minimizing economic stage costs. In its current form, the dissipativity theory for economic MPC applies to problems based on deterministic dynamics or to very specific classes of stochastic problems, and does not readily extend to generic Markov Decision Processes. In this paper, we clarify the core reason for this difficulty, and propose a generalization of the economic MPC dissipativity theory that circumvents it. This generalization focuses on undiscounted infinite-horizon problems and is based on nonlinear stage cost functionals, allowing one to discuss the Lyapunov asymptotic stability of policies for Markov Decision Processes in terms of the probability measures underlying their stochastic dynamics. This theory is illustrated for the stochastic Linear…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Reinforcement Learning in Robotics