Online POMDP Planning via Simplification

Ori Sztyglic; Vadim Indelman

arXiv:2105.05296·cs.AI·May 13, 2021·1 cites

Online POMDP Planning via Simplification

Ori Sztyglic, Vadim Indelman

PDF

Open Access

TL;DR

This paper introduces SITH-BSP, an algorithm for online POMDP planning that simplifies belief representations to achieve faster computation without losing optimality, validated through simulation with sampling-based bounds.

Contribution

The paper presents a novel belief simplification method with bounds for POMDPs, enabling guaranteed optimal solutions with significant speedup in online planning.

Findings

01

Significant computational speedup demonstrated in simulations.

02

Belief simplification with bounds maintains optimality.

03

Novel bounds for differential entropy with sampling-based beliefs.

Abstract

In this paper, we consider online planning in partially observable domains. Solving the corresponding POMDP problem is a very challenging task, particularly in an online setting. Our key contribution is a novel algorithmic approach, Simplified Information Theoretic Belief Space Planning (SITH-BSP), which aims to speed-up POMDP planning considering belief-dependent rewards, without compromising on the solution's accuracy. We do so by mathematically relating the simplified elements of the problem to the corresponding counterparts of the original problem. Specifically, we focus on belief simplification and use it to formulate bounds on the corresponding original belief-dependent rewards. These bounds in turn are used to perform branch pruning over the belief tree, in the process of calculating the optimal policy. We further introduce the notion of adaptive simplification, while re-using…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimization and Search Problems · Machine Learning and Algorithms · Robotic Path Planning Algorithms

MethodsPruning