Rao-Blackwellized POMDP Planning

Jiho Lee; Nisar R. Ahmed; Kyle H. Wray; Zachary N. Sunberg

arXiv:2409.16392·cs.AI·March 5, 2025

Rao-Blackwellized POMDP Planning

Jiho Lee, Nisar R. Ahmed, Kyle H. Wray, Zachary N. Sunberg

PDF

Open Access

TL;DR

This paper introduces Rao-Blackwellized POMDP solvers that improve belief update efficiency and planning accuracy in high-dimensional, uncertain environments compared to traditional particle filters.

Contribution

It proposes a novel Rao-Blackwellized approach for POMDP belief updates and planning, demonstrating improved performance over standard particle filters in simulated localization tasks.

Findings

01

RBPFs maintain accurate beliefs with fewer particles.

02

RBPF combined with quadrature enhances planning quality.

03

RBPF outperforms SIRPF in computational efficiency and accuracy.

Abstract

Partially Observable Markov Decision Processes (POMDPs) provide a structured framework for decision-making under uncertainty, but their application requires efficient belief updates. Sequential Importance Resampling Particle Filters (SIRPF), also known as Bootstrap Particle Filters, are commonly used as belief updaters in large approximate POMDP solvers, but they face challenges such as particle deprivation and high computational costs as the system's state dimension grows. To address these issues, this study introduces Rao-Blackwellized POMDP (RB-POMDP) approximate solvers and outlines generic methods to apply Rao-Blackwellization in both belief updates and online planning. We compare the performance of SIRPF and Rao-Blackwellized Particle Filters (RBPF) in a simulated localization problem where an agent navigates toward a target in a GPS-denied environment using POMCPOW and RB-POMCPOW…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimization and Search Problems · Optimization and Mathematical Programming