What type of inference is planning?
Miguel L\'azaro-Gredilla, Li Yang Ku, Kevin P. Murphy, Dileep George

TL;DR
This paper unifies different planning inference types within a variational framework, showing their relation to entropy weighting, and develops an approximate planning algorithm for complex factored-state MDPs.
Contribution
It introduces a variational perspective that characterizes planning as a specific inference type, enabling the application of variational inference tricks to planning tasks.
Findings
Planning inference types correspond to different entropy weightings in variational problems.
The proposed loopy belief propagation analogue enables approximate planning in large factored-state MDPs.
Different inference types are suitable depending on environment stochasticity levels.
Abstract
Multiple types of inference are available for probabilistic graphical models, e.g., marginal, maximum-a-posteriori, and even marginal maximum-a-posteriori. Which one do researchers mean when they talk about "planning as inference"? There is no consistency in the literature, different types are used, and their ability to do planning is further entangled with specific approximations or additional constraints. In this work we use the variational framework to show that, just like all commonly used types of inference correspond to different weightings of the entropy terms in the variational problem, planning corresponds exactly to a different set of weights. This means that all the tricks of variational inference are readily applicable to planning. We develop an analogue of loopy belief propagation that allows us to perform approximate planning in factored-state Markov decisions processes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsComplex Systems and Decision Making
MethodsSparse Evolutionary Training · Variational Inference
