Markov Decision Processes with Incomplete Information and Semi-Uniform   Feller Transition Probabilities

Eugene A. Feinberg; Pavlo O. Kasyanov; Michael Z. Zgurovsky

arXiv:2108.09232·math.OC·August 30, 2022·SIAM J. Control. Optim.

Markov Decision Processes with Incomplete Information and Semi-Uniform Feller Transition Probabilities

Eugene A. Feinberg, Pavlo O. Kasyanov, Michael Z. Zgurovsky

PDF

Open Access

TL;DR

This paper introduces a class of partially observable stochastic control models with semi-uniform Feller transition probabilities, establishing conditions for optimal policies, value iteration convergence, and generalizing existing results in POMDPs.

Contribution

It defines and analyzes Markov Decision Processes with incomplete information and semi-uniform Feller transitions, extending theoretical foundations and ensuring the existence of optimal policies.

Findings

01

Optimal policies exist under mild conditions.

02

Value iteration converges to optimal values.

03

Generalizes conditions for weak continuity in POMDPs.

Abstract

This paper deals with control of partially observable discrete-time stochastic systems. It introduces and studies Markov Decision Processes with Incomplete Information and with semi-uniform Feller transition probabilities. The important feature of these models is that their classic reduction to Completely Observable Markov Decision Processes with belief states preserves semi-uniform Feller continuity of transition probabilities. Under mild assumptions on cost functions, optimal policies exist, optimality equations hold, and value iterations converge to optimal values for these models. In particular, for Partially Observable Markov Decision Processes the results of this paper imply new and generalize several known sufficient conditions on transition and observation probabilities for weak continuity of transition probabilities for Markov Decision Processes with belief states, the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference