Generalized Value Iteration Networks: Life Beyond Lattices

Sufeng Niu; Siheng Chen; Hanyu Guo; Colin Targonski; Melissa C. Smith,; Jelena Kova\v{c}evi\'c

arXiv:1706.02416·cs.LG·October 27, 2017·22 cites

Generalized Value Iteration Networks: Life Beyond Lattices

Sufeng Niu, Siheng Chen, Hanyu Guo, Colin Targonski, Melissa C. Smith,, Jelena Kova\v{c}evi\'c

PDF

Open Access 1 Repo

TL;DR

This paper presents GVIN, a neural network planning module that generalizes value iteration to irregular graphs using novel graph convolution kernels, improving planning on diverse and unseen graph structures.

Contribution

Introduction of GVIN with three novel graph convolution kernels and episodic Q-learning for stable training on irregular and large-scale graphs.

Findings

01

GVIN outperforms naive VIN on irregular and real-world graphs.

02

Embedding-based kernel achieves best performance among proposed kernels.

03

GVIN generalizes well to unseen and larger graphs.

Abstract

In this paper, we introduce a generalized value iteration network (GVIN), which is an end-to-end neural network planning module. GVIN emulates the value iteration algorithm by using a novel graph convolution operator, which enables GVIN to learn and plan on irregular spatial graphs. We propose three novel differentiable kernels as graph convolution operators and show that the embedding based kernel achieves the best performance. We further propose episodic Q-learning, an improvement upon traditional n-step Q-learning that stabilizes training for networks that contain a planning module. Lastly, we evaluate GVIN on planning problems in 2D mazes, irregular graphs, and real-world street networks, showing that GVIN generalizes well for both arbitrary graphs and unseen graphs of larger scale and outperforms a naive generalization of VIN (discretizing a spatial graph into a 2D image).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sufengniu/GVIN
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Graph Neural Networks · Bayesian Modeling and Causal Inference

MethodsQ-Learning · Convolution