Integrating Planning and Execution in Stochastic Domains

Richard Dearden; Craig Boutilier

arXiv:1302.6799·cs.AI·February 28, 2013

Integrating Planning and Execution in Stochastic Domains

Richard Dearden, Craig Boutilier

PDF

Open Access

TL;DR

This paper presents a method for planning in stochastic, time-critical domains by interleaving search and execution, reducing computational costs while maintaining near-optimal performance.

Contribution

It introduces an approach that combines search and execution in Markov Decision Processes, using fixed-depth search and heuristics to efficiently find near-optimal policies.

Findings

01

Interleaving search and execution yields near-optimal policies.

02

Fixed-depth search with heuristics reduces computational costs.

03

The approach is effective in time-critical stochastic domains.

Abstract

We investigate planning in time-critical domains represented as Markov Decision Processes, showing that search based techniques can be a very powerful method for finding close to optimal plans. To reduce the computational cost of planning in these domains, we execute actions as we construct the plan, and sacrifice optimality by searching to a fixed depth and using a heuristic function to estimate the value of states. Although this paper concentrates on the search algorithm, we also discuss ways of constructing heuristic functions suitable for this approach. Our results show that by interleaving search and execution, close to optimal policies can be found without the computational requirements of other approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI-based Problem Solving and Planning · Logic, Reasoning, and Knowledge · Bayesian Modeling and Causal Inference