Deciding How to Decide: Dynamic Routing in Artificial Neural Networks

Mason McGill; Pietro Perona

arXiv:1703.06217·stat.ML·September 14, 2017·51 cites

Deciding How to Decide: Dynamic Routing in Artificial Neural Networks

Mason McGill, Pietro Perona

PDF

Open Access 1 Repo

TL;DR

This paper introduces and evaluates three strategies for training neural networks with dynamic routing, where different inputs follow different paths, leading to specialized processing and improved performance under fixed computational constraints.

Contribution

It systematically compares three dynamic routing strategies and demonstrates their effectiveness in creating specialized layers and improving image classification performance.

Findings

01

Layers and branches become category-specific.

02

Dynamically-routed networks outperform static ones at fixed computational budgets.

03

Different routing strategies have comparable qualitative network structures.

Abstract

We propose and systematically evaluate three strategies for training dynamically-routed artificial neural networks: graphs of learned transformations through which different input signals may take different paths. Though some approaches have advantages over others, the resulting networks are often qualitatively similar. We find that, in dynamically-routed networks trained to classify images, layers and branches become specialized to process distinct categories of images. Additionally, given a fixed computational budget, dynamically-routed networks tend to perform better than comparable statically-routed networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MasonMcGill/multipath-nn
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications