LP Based Upper and Lower Bounds for Ces\`aro and Abel Limits of the   Optimal Values in Problems of Control of Stochastic Discrete Time Systems

Konstantin Avrachenkov; Vladimir Gaitsgory; and Lucas Gamertsfelder

arXiv:2010.15375·math.OC·March 9, 2022

LP Based Upper and Lower Bounds for Ces\`aro and Abel Limits of the Optimal Values in Problems of Control of Stochastic Discrete Time Systems

Konstantin Avrachenkov, Vladimir Gaitsgory, and Lucas Gamertsfelder

PDF

TL;DR

This paper investigates the asymptotic behavior of stochastic control problems in discrete time, showing that Cesàro and Abel limits of optimal values can be characterized via infinite-dimensional linear programming and its dual.

Contribution

It introduces a novel approach to evaluate Cesàro and Abel limits of optimal values using infinite-dimensional linear programming for stochastic control problems.

Findings

01

Cesàro and Abel limits are characterized by linear programming formulations.

02

The approach applies to Markov decision processes with time averaging and discounting.

03

Provides a new method for analyzing long-term behavior of stochastic control systems.

Abstract

In this paper, we study asymptotic properties of problems of control of stochastic discrete time systems (also known as Markov decision processes) with time averaging and time discounting optimality criteria, and we establish that the Ces\`aro and Abel limits of the optimal values in such problems can be evaluated with the help of a certain infinite-dimensional linear programming problem and its dual.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.