Finding Approximate POMDP solutions Through Belief Compression

N. Roy; G. Gordon; S. Thrun

arXiv:1107.0053·cs.AI·October 5, 2011·23 cites

Finding Approximate POMDP solutions Through Belief Compression

N. Roy, G. Gordon, S. Thrun

PDF

Open Access

TL;DR

This paper introduces a belief compression method using exponential family PCA to approximate solutions for large-scale POMDPs by focusing on low-dimensional belief subspaces, enabling scalable policy computation.

Contribution

The paper presents a novel belief compression technique using exponential family PCA to efficiently approximate POMDP solutions in high-dimensional spaces.

Findings

01

Able to handle POMDPs much larger than traditional methods

02

Effective belief space dimensionality reduction demonstrated

03

Successful application to robot navigation tasks

Abstract

Standard value function approaches to finding policies for Partially Observable Markov Decision Processes (POMDPs) are generally considered to be intractable for large models. The intractability of these algorithms is to a large extent a consequence of computing an exact, optimal policy over the entire belief space. However, in real-world POMDP problems, computing the optimal policy for the full belief space is often unnecessary for good control even for problems with complicated policy classes. The beliefs experienced by the controller often lie near a structured, low-dimensional subspace embedded in the high-dimensional belief space. Finding a good approximation to the optimal value function for only this subspace can be much easier than computing the full value function. We introduce a new method for solving large-scale POMDPs by reducing the dimensionality of the belief space. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Sensor Networks and Detection Algorithms