# Stochastic Model Predictive Control: Output-Feedback, Duality and   Guaranteed Performance

**Authors:** Martin A Sehr, Robert R Bitmead

arXiv: 1706.00733 · 2020-05-01

## TL;DR

This paper introduces a new formulation for stochastic model predictive output feedback control that naturally incorporates output feedback, dual control, and guarantees near-infinite horizon optimality, despite computational challenges.

## Contribution

It translates stochastic optimal output feedback control into a receding horizon framework, extending to POMDPs and demonstrating applicability in healthcare decision making.

## Key findings

- Incorporates output feedback naturally
- Dual regulation and probing are inherent in the control
- Guarantees closed-loop performance relative to infinite-horizon optimal control

## Abstract

A new formulation of Stochastic Model Predictive Output Feedback Control is presented and analyzed as a translation of Stochastic Optimal Output Feedback Control into a receding horizon setting. This requires lifting the design into a framework involving propagation of the conditional state density, the information state, via the Bayesian Filter and solution of the Stochastic Dynamic Programming Equation for an optimal feedback policy, both stages of which are computationally challenging in the general, nonlinear setup. The upside is that the clearance of three bottleneck aspects of Model Predictive Control is connate to the optimality: output feedback is incorporated naturally; dual regulation and probing of the control signal is inherent; closed-loop performance relative to infinite-horizon optimal control is guaranteed. While the methods are numerically formidable, our aim is to develop an approach to Stochastic Model Predictive Control with guarantees and, from there, to seek a less onerous approximation. To this end, we discuss in particular the class of Partially Observable Markov Decision Processes, to which our results extend seamlessly, and demonstrate applicability with an example in healthcare decision making, where duality and associated optimality in the control signal are required for satisfactory closed-loop behavior.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1706.00733/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/1706.00733/full.md

## References

33 references — full list in the complete paper: https://tomesphere.com/paper/1706.00733/full.md

---
Source: https://tomesphere.com/paper/1706.00733