Robustness to incorrect system models in stochastic control

Ali Devran Kara; Serdar Y\"uksel

arXiv:1803.06046·cs.SY·February 4, 2020

Robustness to incorrect system models in stochastic control

Ali Devran Kara, Serdar Y\"uksel

PDF

TL;DR

This paper investigates the robustness of stochastic control policies to model inaccuracies, establishing conditions under which optimal costs are continuous and providing bounds on performance loss due to model mismatch.

Contribution

It offers refined robustness results applicable under various convergence criteria, including weak and setwise, extending the understanding of control policy robustness in data-driven settings.

Findings

01

Optimal cost is robust under total variation convergence.

02

Continuity of optimal cost can be achieved under weak convergence with additional assumptions.

03

Provides convergence results and error bounds for model mismatch scenarios.

Abstract

In stochastic control applications, typically only an ideal model (controlled transition kernel) is assumed and the control design is based on the given model, raising the problem of performance loss due to the mismatch between the assumed model and the actual model. Toward this end, we study continuity properties of discrete-time stochastic control problems with respect to system models (i.e., controlled transition kernels) and robustness of optimal control policies designed for incorrect models applied to the true system. We study both fully observed and partially observed setups under an infinite horizon discounted expected cost criterion. We show that continuity and robustness cannot be established under weak and setwise convergences of transition kernels in general, but that the expected induced cost is robust under total variation. By imposing further assumptions on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.