Finding Approximate POMDP solutions Through Belief Compression
N. Roy, G. Gordon, S. Thrun

TL;DR
This paper introduces a belief compression method using exponential family PCA to approximate solutions for large-scale POMDPs by focusing on low-dimensional belief subspaces, enabling scalable policy computation.
Contribution
The paper presents a novel belief compression technique using exponential family PCA to efficiently approximate POMDP solutions in high-dimensional spaces.
Findings
Able to handle POMDPs much larger than traditional methods
Effective belief space dimensionality reduction demonstrated
Successful application to robot navigation tasks
Abstract
Standard value function approaches to finding policies for Partially Observable Markov Decision Processes (POMDPs) are generally considered to be intractable for large models. The intractability of these algorithms is to a large extent a consequence of computing an exact, optimal policy over the entire belief space. However, in real-world POMDP problems, computing the optimal policy for the full belief space is often unnecessary for good control even for problems with complicated policy classes. The beliefs experienced by the controller often lie near a structured, low-dimensional subspace embedded in the high-dimensional belief space. Finding a good approximation to the optimal value function for only this subspace can be much easier than computing the full value function. We introduce a new method for solving large-scale POMDPs by reducing the dimensionality of the belief space. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Sensor Networks and Detection Algorithms
