Efficient Constraint Generation for Stochastic Shortest Path Problems
Johannes Schmalz, Felipe Trevizan

TL;DR
This paper introduces CG-iLAO*, a new algorithm that significantly reduces the number of actions considered in stochastic shortest path problems by using a linear programming-based constraint generation technique, leading to faster solutions.
Contribution
It presents a novel constraint generation approach for heuristic search in SSPs, enabling the algorithm to consider fewer actions and solve problems more efficiently.
Findings
CG-iLAO* considers only 40% of actions compared to iLAO* on many problems.
It computes 3.5 times fewer costs-to-go for actions than iLAO* and LRTDP.
CG-iLAO* solves problems on average 2.8 to 3.7 times faster.
Abstract
Stochastic Shortest Path problems (SSPs) are traditionally solved by computing each state's cost-to-go by applying Bellman backups. A Bellman backup updates a state's cost-to-go by iterating through every applicable action, computing the cost-to-go after applying each one, and selecting a minimal action's cost-to-go. State-of-the-art algorithms use heuristic functions; these give an initial estimate of costs-to-go, and lets the algorithm apply Bellman backups only to promising states, determined by low estimated costs-to-go. However, each Bellman backup still considers all applicable actions, even if the heuristic tells us that some of these actions are too expensive, with the effect that such algorithms waste time on unhelpful actions. To address this gap we present a technique that uses the heuristic to avoid expensive actions, by reframing heuristic search in terms of linear…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
