Lazy Heuristic Search for Solving POMDPs with Expensive-to-Compute Belief Transitions
Muhammad Suhail Saleem, Rishi Veerapaneni, Maxim Likhachev

TL;DR
This paper introduces lazy heuristic search algorithms for POMDPs that defer costly belief transition computations, significantly reducing planning time in robotics applications while maintaining solution quality.
Contribution
The paper proposes Lazy RTDP-Bel and Lazy LAO* algorithms that postpone expensive belief updates using Q-value estimation, enhancing efficiency in robotics POMDPs.
Findings
Lazy planners outperform traditional methods in planning speed.
They maintain solution quality despite deferred belief computations.
Effective Q-value estimation techniques are discussed for practical use.
Abstract
Heuristic search solvers like RTDP-Bel and LAO* have proven effective for computing optimal and bounded sub-optimal solutions for Partially Observable Markov Decision Processes (POMDPs), which are typically formulated as belief MDPs. A belief represents a probability distribution over possible system states. Given a parent belief and an action, computing belief state transitions involves Bayesian updates that combine the transition and observation models of the POMDP to determine successor beliefs and their transition probabilities. However, there is a class of problems, specifically in robotics, where computing these transitions can be prohibitively expensive due to costly physics simulations, raycasting, or expensive collision checks required by the underlying transition and observation models, leading to long planning times. To address this challenge, we propose Lazy RTDP-Bel and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVehicle Routing Optimization Methods · Constraint Satisfaction and Optimization · Robotic Path Planning Algorithms
