Sparse tree search optimality guarantees in POMDPs with continuous   observation spaces

Michael H. Lim; Claire J. Tomlin; Zachary N. Sunberg

arXiv:1910.04332·cs.LG·June 6, 2023

Sparse tree search optimality guarantees in POMDPs with continuous observation spaces

Michael H. Lim, Claire J. Tomlin, Zachary N. Sunberg

PDF

1 Repo

TL;DR

This paper provides a theoretical guarantee for a sampling-based algorithm in POMDPs with continuous observations, showing it can approximate optimal solutions with sufficient computation.

Contribution

It introduces and proves the effectiveness of the POWSS algorithm, offering formal guarantees for its accuracy and near-optimality in continuous observation POMDPs.

Findings

01

POWSS estimates Q-values accurately with high probability.

02

Increasing computational resources improves near-optimal performance.

03

Provides formal theoretical justification for observation likelihood weighting techniques.

Abstract

Partially observable Markov decision processes (POMDPs) with continuous state and observation spaces have powerful flexibility for representing real-world decision and control problems but are notoriously difficult to solve. Recent online sampling-based algorithms that use observation likelihood weighting have shown unprecedented effectiveness in domains with continuous observation spaces. However there has been no formal theoretical justification for this technique. This work offers such a justification, proving that a simplified algorithm, partially observable weighted sparse sampling (POWSS), will estimate Q-values accurately with high probability and can be made to perform arbitrarily near the optimal solution by increasing computational power.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

JuliaPOMDP/SparseSampling.jl
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.