Optimal Limited Contingency Planning

Nicolas Meuleau; David Smith

arXiv:1212.2502·cs.AI·December 12, 2012·30 cites

Optimal Limited Contingency Planning

Nicolas Meuleau, David Smith

PDF

Open Access

TL;DR

This paper introduces an efficient, optimal algorithm for limited contingency planning that avoids explicit enumeration, enabling the creation of simple, safe plans with a restricted number of decision branches.

Contribution

It presents the first optimal algorithm for k-contingency planning that models the problem as a POMDP and prunes the solution space without enumeration.

Findings

01

Algorithm successfully computes optimal plans with limited branches.

02

Prunes the solution space effectively, improving efficiency.

03

Applicable to simple test cases demonstrating feasibility.

Abstract

For a given problem, the optimal Markov policy can be considerred as a conditional or contingent plan containing a (potentially large) number of branches. Unfortunately, there are applications where it is desirable to strictly limit the number of decision points and branches in a plan. For example, it may be that plans must later undergo more detailed simulation to verify correctness and safety, or that they must be simple enough to be understood and analyzed by humans. As a result, it may be necessary to limit consideration to plans with only a small number of branches. This raises the question of how one goes about finding optimal plans containing only a limited number of branches. In this paper, we present an any-time algorithm for optimal k-contingency planning (OKP). It is the first optimal algorithm for limited contingency planning that is not an explicit enumeration of possible…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Optimization and Search Problems · AI-based Problem Solving and Planning