Near Optimality of Quantized Policies in Stochastic Control Under Weak Continuity Conditions
Naci Saldi, Serdar Y\"uksel, Tam\'as Linder

TL;DR
This paper demonstrates that quantized policies can approximate optimal control policies in Markov decision processes under weak continuity conditions, broadening applicability in networked control and learning algorithms.
Contribution
It introduces a constructive discretization method for stationary policies in MDPs under weak continuity, enabling near-optimal approximations with less restrictive assumptions.
Findings
Quantized policies can approximate optimal policies arbitrarily closely.
Weak continuity assumptions are sufficient for discretization in MDPs.
Application to partially observed MDPs shows practical relevance.
Abstract
This paper studies the approximation of optimal control policies by quantized (discretized) policies for a very general class of Markov decision processes (MDPs). The problem is motivated by applications in networked control systems, computational methods for MDPs, and learning algorithms for MDPs. We consider the finite-action approximation of stationary policies for a discrete-time Markov decision process with discounted and average costs under a weak continuity assumption on the transition probability, which is a significant relaxation of conditions required in earlier literature. The discretization is constructive, and quantized policies are shown to approximate optimal deterministic stationary policies with arbitrary precision. The results are applied to the fully observed reduction of a partially observed Markov decision process, where weak continuity is a much more reasonable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
