Convergence of Probability Measures and Markov Decision Models with   Incomplete Information

Eugene A. Feinberg; Pavlo O. Kasyanov; Michael Z. Zgurovsky

arXiv:1407.1029·math.PR·July 4, 2014

Convergence of Probability Measures and Markov Decision Models with Incomplete Information

Eugene A. Feinberg, Pavlo O. Kasyanov, Michael Z. Zgurovsky

PDF

Open Access

TL;DR

This paper analyzes different types of convergence of probability measures on metric spaces, providing criteria and conditions, and applies these results to control problems in Partially Observable Markov Decision Processes with incomplete information.

Contribution

It offers new criteria for convergence of probability measures and stochastic kernels, and applies these to control models with incomplete information.

Findings

01

Criteria for weak and setwise convergence established

02

Continuity conditions for stochastic kernels derived

03

Applications to control of Partially Observable Markov Decision Processes

Abstract

This paper deals with three major types of convergence of probability measures on metric spaces: weak convergence, setwise converges, and convergence in the total variation. First, it describes and compares necessary and sufficient conditions for these types of convergence, some of which are well-known, in terms of convergence of probabilities of open and closed sets and, for the probabilities on the real line, in terms of convergence of distribution functions. Second, it provides % convenient criteria for weak and setwise convergence of probability measures and continuity of stochastic kernels in terms of convergence of probabilities defined on the base of the topology generated by the metric. Third, it provides applications to control of Partially Observable Markov Decision Processes and, in particular, to Markov Decision Models with incomplete information.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods · Probability and Risk Models · Statistical Methods and Inference