Simplified POMDP Planning with an Alternative Observation Space and   Formal Performance Guarantees

Da Kong; Vadim Indelman

arXiv:2410.07630·cs.RO·October 14, 2024

Simplified POMDP Planning with an Alternative Observation Space and Formal Performance Guarantees

Da Kong, Vadim Indelman

PDF

Open Access

TL;DR

This paper introduces a novel method for simplifying POMDP planning by using an alternative observation space and formal bounds, enabling faster decision-making with guaranteed performance in uncertain environments.

Contribution

It proposes a new framework that switches to a compact observation space with formal bounds, improving planning efficiency while maintaining solution quality.

Findings

01

Significant speedup in POMDP planning in simulations.

02

Formal bounds relate simplified and original POMDP solutions.

03

Effective in both exact and approximate solvers.

Abstract

Online planning under uncertainty in partially observable domains is an essential capability in robotics and AI. The partially observable Markov decision process (POMDP) is a mathematically principled framework for addressing decision-making problems in this challenging setting. However, finding an optimal solution for POMDPs is computationally expensive and is feasible only for small problems. In this work, we contribute a novel method to simplify POMDPs by switching to an alternative, more compact, observation space and simplified model to speedup planning with formal performance guarantees. We introduce the notion of belief tree topology, which encodes the levels and branches in the tree that use the original and alternative observation space and models. Each belief tree topology comes with its own policy space and planning performance. Our key contribution is to derive bounds…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Formal Methods in Verification · Robotic Path Planning Algorithms