Region-Based Incremental Pruning for POMDPs
Zhengzhu Feng, Shlomo Zilberstein

TL;DR
This paper introduces a regional incremental pruning method for POMDPs that significantly improves scalability by dividing the belief space into smaller regions for independent pruning, enhancing performance in complex domains.
Contribution
The paper presents a novel regional approach to incremental pruning that reduces complexity and improves scalability in solving POMDPs compared to existing methods.
Findings
Significant performance improvements demonstrated experimentally.
Analytical evidence shows reduced complexity in belief space reasoning.
Enables solving larger and more complex POMDP domains.
Abstract
We present a major improvement to the incremental pruning algorithm for solving partially observable Markov decision processes. Our technique targets the cross-sum step of the dynamic programming (DP) update, a key source of complexity in POMDP algorithms. Instead of reasoning about the whole belief space when pruning the cross-sums, our algorithm divides the belief space into smaller regions and performs independent pruning in each region. We evaluate the benefits of the new technique both analytically and experimentally, and show that it produces very significant performance gains. The results contribute to the scalability of POMDP algorithms to domains that cannot be handled by the best existing techniques.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Machine Learning and Algorithms · AI-based Problem Solving and Planning
