Learning model-based planning from scratch
Razvan Pascanu, Yujia Li, Oriol Vinyals, Nicolas Heess, Lars Buesing,, Sebastien Racani\`ere, David Reichert, Th\'eophane Weber, Daan Wierstra,, Peter Battaglia

TL;DR
This paper introduces the Imagination-based Planner, a novel model-based decision-making agent that learns to construct, evaluate, and execute plans through flexible imagination steps, optimizing both rewards and computational costs.
Contribution
It presents the first agent capable of learning to plan by constructing and evaluating imagined actions, including complex imagination strategies, in continuous and discrete tasks.
Findings
Successfully solves a continuous control problem.
Learns elaborate planning strategies in maze tasks.
Optimizes planning and computational costs.
Abstract
Conventional wisdom holds that model-based planning is a powerful approach to sequential decision-making. It is often very challenging in practice, however, because while a model can be used to evaluate a plan, it does not prescribe how to construct a plan. Here we introduce the "Imagination-based Planner", the first model-based, sequential decision-making agent that can learn to construct, evaluate, and execute plans. Before any action, it can perform a variable number of imagination steps, which involve proposing an imagined action and evaluating it with its model-based imagination. All imagined actions and outcomes are aggregated, iteratively, into a "plan context" which conditions future real and imagined actions. The agent can even decide how to imagine: testing out alternative imagined actions, chaining sequences of actions together, or building a more complex "imagination tree"…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Learning model-based planning from scratch· youtube
Taxonomy
TopicsArtificial Intelligence in Games · Reinforcement Learning in Robotics · Robot Manipulation and Learning
