Learning model-based planning from scratch

Razvan Pascanu; Yujia Li; Oriol Vinyals; Nicolas Heess; Lars Buesing,; Sebastien Racani\`ere; David Reichert; Th\'eophane Weber; Daan Wierstra,; Peter Battaglia

arXiv:1707.06170·cs.AI·July 20, 2017·78 cites

Learning model-based planning from scratch

Razvan Pascanu, Yujia Li, Oriol Vinyals, Nicolas Heess, Lars Buesing,, Sebastien Racani\`ere, David Reichert, Th\'eophane Weber, Daan Wierstra,, Peter Battaglia

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces the Imagination-based Planner, a novel model-based decision-making agent that learns to construct, evaluate, and execute plans through flexible imagination steps, optimizing both rewards and computational costs.

Contribution

It presents the first agent capable of learning to plan by constructing and evaluating imagined actions, including complex imagination strategies, in continuous and discrete tasks.

Findings

01

Successfully solves a continuous control problem.

02

Learns elaborate planning strategies in maze tasks.

03

Optimizes planning and computational costs.

Abstract

Conventional wisdom holds that model-based planning is a powerful approach to sequential decision-making. It is often very challenging in practice, however, because while a model can be used to evaluate a plan, it does not prescribe how to construct a plan. Here we introduce the "Imagination-based Planner", the first model-based, sequential decision-making agent that can learn to construct, evaluate, and execute plans. Before any action, it can perform a variable number of imagination steps, which involve proposing an imagined action and evaluating it with its model-based imagination. All imagined actions and outcomes are aggregated, iteratively, into a "plan context" which conditions future real and imagined actions. The agent can even decide how to imagine: testing out alternative imagined actions, chaining sequences of actions together, or building a more complex "imagination tree"…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Learning model-based planning from scratch· youtube

Taxonomy

TopicsArtificial Intelligence in Games · Reinforcement Learning in Robotics · Robot Manipulation and Learning