Point-Based POMDP Algorithms: Improved Analysis and Implementation

Trey Smith; Reid Simmons

arXiv:1207.1412·cs.AI·July 9, 2012·93 cites

Point-Based POMDP Algorithms: Improved Analysis and Implementation

Trey Smith, Reid Simmons

PDF

Open Access

TL;DR

This paper introduces a new complexity bound for point-based POMDP algorithms that combines dimensionality and historical factors, along with improved implementation techniques for better efficiency.

Contribution

It presents a novel complexity bound using discounted reachability and discusses enhancements to heuristic search value iteration algorithms.

Findings

01

Derived a new complexity bound combining curse of dimensionality and history

02

Implemented tighter initial bounds and avoided linear programs

03

Enhanced efficiency through better use of sparsity

Abstract

Existing complexity bounds for point-based POMDP value iteration algorithms focus either on the curse of dimensionality or the curse of history. We derive a new bound that relies on both and uses the concept of discounted reachability; our conclusions may help guide future algorithm design. We also discuss recent improvements to our (point-based) heuristic search value iteration algorithm. Our new implementation calculates tighter initial bounds, avoids solving linear programs, and makes more effective use of sparsity.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Optimization Algorithms Research · Complexity and Algorithms in Graphs · Optimization and Search Problems