Asymptotic Optimality of Finite Approximations to Markov Decision   Processes with Borel Spaces

Naci Saldi; Serdar Y\"uksel; Tam\'as Linder

arXiv:1503.02244·math.OC·September 23, 2016

Asymptotic Optimality of Finite Approximations to Markov Decision Processes with Borel Spaces

Naci Saldi, Serdar Y\"uksel, Tam\'as Linder

PDF

TL;DR

This paper demonstrates that finite-state approximations of Borel-space Markov decision processes can achieve near-optimal policies with explicit convergence rates, enabling practical computation of solutions for complex MDPs.

Contribution

It establishes the asymptotic optimality and convergence rates of finite approximations for Borel-space MDPs, including explicit bounds and conditions for both discounted and average costs.

Findings

01

Finite-state approximations can approximate optimal policies arbitrarily closely.

02

Explicit convergence rate bounds are derived for compact-state MDPs.

03

Action space discretization enables the use of standard algorithms for near-optimal policy computation.

Abstract

Calculating optimal policies is known to be computationally difficult for Markov decision processes (MDPs) with Borel state and action spaces. This paper studies finite-state approximations of discrete time Markov decision processes with Borel state and action spaces, for both discounted and average costs criteria. The stationary policies thus obtained are shown to approximate the optimal stationary policy with arbitrary precision under quite general conditions for discounted cost and more restrictive conditions for average cost. For compact-state MDPs, we obtain explicit rate of convergence bounds quantifying how the approximation improves as the size of the approximating finite state space increases. Using information theoretic arguments, the order optimality of the obtained convergence rates is established for a large class of problems. We also show that, as a pre-processing step the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.