Approximation Methods for Partially Observed Markov Decision Processes   (POMDPs)

Caleb M. Bowyer

arXiv:2108.13965·cs.LG·September 1, 2021

Approximation Methods for Partially Observed Markov Decision Processes (POMDPs)

Caleb M. Bowyer

PDF

Open Access

TL;DR

This survey reviews the origins, theory, and approximation methods for finite-state POMDPs, emphasizing their computational challenges and recent research directions in control under uncertainty.

Contribution

It provides a comprehensive overview of POMDPs, focusing on approximation techniques and recent research directions, with essential background on MDPs and HMMs.

Findings

01

Approximation methods are crucial for handling large state spaces in POMDPs.

02

Exact solutions are computationally intensive, necessitating efficient approximations.

03

Emerging research directions aim to improve scalability and solution quality.

Abstract

POMDPs are useful models for systems where the true underlying state is not known completely to an outside observer; the outside observer incompletely knows the true state of the system, and observes a noisy version of the true system state. When the number of system states is large in a POMDP that often necessitates the use of approximation methods to obtain near optimal solutions for control. This survey is centered around the origins, theory, and approximations of finite-state POMDPs. In order to understand POMDPs, it is required to have an understanding of finite-state Markov Decision Processes (MDPs) in \autoref{mdp} and Hidden Markov Models (HMMs) in \autoref{hmm}. For this background theory, I provide only essential details on MDPs and HMMs and leave longer expositions to textbook treatments before diving into the main topics of POMDPs. Once the required background is covered,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFormal Methods in Verification · Bayesian Modeling and Causal Inference · Software Reliability and Analysis Research