An On-Line POMDP Solver for Continuous Observation Spaces

Marcus Hoerger; Hanna Kurniawati

arXiv:2011.02076·cs.AI·November 5, 2020

An On-Line POMDP Solver for Continuous Observation Spaces

Marcus Hoerger, Hanna Kurniawati

PDF

TL;DR

This paper introduces LABECOP, an online POMDP solver for continuous observation spaces that combines Monte-Carlo-Tree-Search and particle filtering, avoiding discretization and observation limiting.

Contribution

The paper presents LABECOP, a novel online POMDP solver that handles continuous observations without discretization or observation limiting, improving planning efficiency and robustness.

Findings

01

LABECOP performs comparably or better than existing solvers.

02

It effectively handles continuous observation spaces.

03

Experiments demonstrate its efficiency across different problems.

Abstract

Planning under partial obervability is essential for autonomous robots. A principled way to address such planning problems is the Partially Observable Markov Decision Process (POMDP). Although solving POMDPs is computationally intractable, substantial advancements have been achieved in developing approximate POMDP solvers in the past two decades. However, computing robust solutions for problems with continuous observation spaces remains challenging. Most on-line solvers rely on discretising the observation space or artificially limiting the number of observations that are considered during planning to compute tractable policies. In this paper we propose a new on-line POMDP solver, called Lazy Belief Extraction for Continuous POMDPs (LABECOP), that combines methods from Monte-Carlo-Tree-Search and particle filtering to construct a policy reprentation which doesn't require discretised…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.