Optimal Limited Contingency Planning
Nicolas Meuleau, David Smith

TL;DR
This paper introduces an efficient, optimal algorithm for limited contingency planning that avoids explicit enumeration, enabling the creation of simple, safe plans with a restricted number of decision branches.
Contribution
It presents the first optimal algorithm for k-contingency planning that models the problem as a POMDP and prunes the solution space without enumeration.
Findings
Algorithm successfully computes optimal plans with limited branches.
Prunes the solution space effectively, improving efficiency.
Applicable to simple test cases demonstrating feasibility.
Abstract
For a given problem, the optimal Markov policy can be considerred as a conditional or contingent plan containing a (potentially large) number of branches. Unfortunately, there are applications where it is desirable to strictly limit the number of decision points and branches in a plan. For example, it may be that plans must later undergo more detailed simulation to verify correctness and safety, or that they must be simple enough to be understood and analyzed by humans. As a result, it may be necessary to limit consideration to plans with only a small number of branches. This raises the question of how one goes about finding optimal plans containing only a limited number of branches. In this paper, we present an any-time algorithm for optimal k-contingency planning (OKP). It is the first optimal algorithm for limited contingency planning that is not an explicit enumeration of possible…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Optimization and Search Problems · AI-based Problem Solving and Planning
