Average Cost Optimality of Partially Observed MDPS: Contraction of Non-linear Filters, Optimal Solutions and Approximations
Yunus Emre Demirci, Ali Devran Kara, Serdar Y\"uksel

TL;DR
This paper establishes explicit conditions for average cost optimality in partially observed Markov decision processes with compact state spaces, introducing a new contraction analysis and demonstrating several practical implications.
Contribution
It introduces a new contraction-based analysis for average cost optimality in POMDPs with compact states, providing explicit conditions and several novel implications.
Findings
New contraction analysis for non-linear filters
Robustness to incorrect priors demonstrated
Near optimality of quantized and finite-memory policies shown
Abstract
The average cost optimality is known to be a challenging problem for partially observable stochastic control, with few results available beyond the finite state, action, and measurement setup, for which somewhat restrictive conditions are available. In this paper, we present explicit and easily testable conditions for the existence of solutions to the average cost optimality equation where the state space is compact. In particular, we present a new contraction based analysis, which is new to the literature to our knowledge, building on recent regularity results for non-linear filters. Beyond establishing existence, we also present several implications of our analysis that are new to the literature: (i) robustness to incorrect priors (ii) near optimality of policies based on quantized approximations, (iii) near optimality of policies with finite memory, and (iv) convergence in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Optimization · Stochastic processes and financial applications · Advanced Thermodynamics and Statistical Mechanics
