Deliberative Acting, Online Planning and Learning with Hierarchical Operational Models
Sunandita Patra, James Mason, Malik Ghallab, Dana Nau, Paolo Traverso

TL;DR
This paper introduces an integrated system combining acting and planning using hierarchical operational models, enabling online decision-making, learning, and improved robustness in AI agents.
Contribution
It proposes a unified operational model for both planning and acting, along with a Monte Carlo Tree Search planner and learning strategies, simplifying the integration of acting and planning.
Findings
UPOM converges to optimal methods in static domains.
Learning strategies improve acting efficiency.
Experimental results show enhanced robustness and performance.
Abstract
In AI research, synthesizing a plan of action has typically used descriptive models of the actions that abstractly specify what might happen as a result of an action, and are tailored for efficiently computing state transitions. However, executing the planned actions has needed operational models, in which rich computational control structures and closed-loop online decision-making are used to specify how to perform an action in a nondeterministic execution context, react to events and adapt to an unfolding situation. Deliberative actors, which integrate acting and planning, have typically needed to use both of these models together -- which causes problems when attempting to develop the different models, verify their consistency, and smoothly interleave acting and planning. As an alternative, we define and implement an integrated acting and planning system in which both planning and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI-based Problem Solving and Planning · Logic, Reasoning, and Knowledge · Semantic Web and Ontologies
MethodsRegularized Autoencoders
