Reinforcement Learning based on MPC/MHE for Unmodeled and Partially   Observable Dynamics

Hossein Nejatbakhsh Esfahani; Arash Bahari Kordabad; Sebastien Gros

arXiv:2103.11871·eess.SY·March 23, 2021

Reinforcement Learning based on MPC/MHE for Unmodeled and Partially Observable Dynamics

Hossein Nejatbakhsh Esfahani, Arash Bahari Kordabad, Sebastien Gros

PDF

TL;DR

This paper introduces a reinforcement learning approach to tune MPC and MHE parameters for better control of systems with unmodeled and partially observable dynamics, improving closed-loop performance.

Contribution

It presents a novel RL-based method to jointly tune MPC and MHE parameters for POMDPs without relying on accurate models.

Findings

01

Enhanced control performance in POMDP systems

02

Effective parameter tuning via RL for unmodeled dynamics

03

Improved closed-loop control outcomes

Abstract

This paper proposes an observer-based framework for solving Partially Observable Markov Decision Processes (POMDPs) when an accurate model is not available. We first propose to use a Moving Horizon Estimation-Model Predictive Control (MHE-MPC) scheme in order to provide a policy for the POMDP problem, where the full state of the real process is not measured and necessarily known. We propose to parameterize both MPC and MHE formulations, where certain adjustable parameters are regarded for tuning the policy. In this paper, for the sake of tackling the unmodeled and partially observable dynamics, we leverage the Reinforcement Learning (RL) to tune the parameters of MPC and MHE schemes jointly, with the closed-loop performance of the policy as a goal rather than model fitting or the MHE performance. Illustrations show that the proposed approach can effectively increase the performance of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.