Online algorithms for POMDPs with continuous state, action, and   observation spaces

Zachary Sunberg; Mykel Kochenderfer

arXiv:1709.06196·cs.AI·September 7, 2018·42 cites

Online algorithms for POMDPs with continuous state, action, and observation spaces

Zachary Sunberg, Mykel Kochenderfer

PDF

Open Access 4 Repos

TL;DR

This paper addresses the challenge of applying online POMDP algorithms to continuous spaces by proposing two new algorithms, POMCPOW and PFT-DPW, which use weighted particle filtering to improve policy quality.

Contribution

The paper introduces POMCPOW and PFT-DPW algorithms that overcome belief collapse issues in continuous POMDPs by employing weighted particle filtering techniques.

Findings

01

POMCPOW and PFT-DPW outperform previous methods in continuous POMDPs.

02

Weighted particle filtering prevents belief collapse in search trees.

03

Simulation results demonstrate improved success in complex continuous problems.

Abstract

Online solvers for partially observable Markov decision processes have been applied to problems with large discrete state spaces, but continuous state, action, and observation spaces remain a challenge. This paper begins by investigating double progressive widening (DPW) as a solution to this challenge. However, we prove that this modification alone is not sufficient because the belief representations in the search tree collapse to a single particle causing the algorithm to converge to a policy that is suboptimal regardless of the computation time. This paper proposes and evaluates two new algorithms, POMCPOW and PFT-DPW, that overcome this deficiency by using weighted particle filtering. Simulation results show that these modifications allow the algorithms to be successful where previous approaches fail.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and ELM · Domain Adaptation and Few-Shot Learning