Neural Algorithmic Reasoners are Implicit Planners
Andreea Deac, Petar Veli\v{c}kovi\'c, Ognjen Milinkovi\'c, Pierre-Luc, Bacon, Jian Tang, Mladen Nikoli\'c

TL;DR
This paper introduces XLVINs, a novel neural planning method that performs value iteration in a latent space, improving data efficiency and aligning with classical planning algorithms in various environments.
Contribution
We propose XLVINs, which perform implicit planning in a high-dimensional latent space, overcoming limitations of prior approaches and enhancing data efficiency in reinforcement learning tasks.
Findings
XLVINs outperform prior implicit planners in low-data settings.
XLVINs closely align with traditional value iteration.
The method improves data efficiency across classical control, navigation, and Atari environments.
Abstract
Implicit planning has emerged as an elegant technique for combining learned models of the world with end-to-end model-free reinforcement learning. We study the class of implicit planners inspired by value iteration, an algorithm that is guaranteed to yield perfect policies in fully-specified tabular environments. We find that prior approaches either assume that the environment is provided in such a tabular form -- which is highly restrictive -- or infer "local neighbourhoods" of states to run value iteration over -- for which we discover an algorithmic bottleneck effect. This effect is caused by explicitly running the planning algorithm based on scalar predictions in every state, which can be harmful to data efficiency if such scalars are improperly predicted. We propose eXecuted Latent Value Iteration Networks (XLVINs), which alleviate the above limitations. Our method performs all…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · AI-based Problem Solving and Planning · Neural Networks and Applications
