Connectivity Learning in Multi-Branch Networks

Karim Ahmed; Lorenzo Torresani

arXiv:1709.09582·cs.LG·December 8, 2017·25 cites

Connectivity Learning in Multi-Branch Networks

Karim Ahmed, Lorenzo Torresani

PDF

Open Access

TL;DR

This paper introduces a method to automatically learn the optimal connectivity between branches in multi-branch neural networks, leading to improved classification accuracy over fixed-connection architectures.

Contribution

It proposes an algorithm that learns multi-branch network connections jointly with weights, removing predefined design choices and optimizing for the end task.

Findings

01

Achieves higher accuracy than ResNeXt on multiple datasets.

02

Automatically learns branch connectivity, reducing manual design effort.

03

Improves performance without increasing model capacity.

Abstract

While much of the work in the design of convolutional networks over the last five years has revolved around the empirical investigation of the importance of depth, filter sizes, and number of feature channels, recent studies have shown that branching, i.e., splitting the computation along parallel but distinct threads and then aggregating their outputs, represents a new promising dimension for significant improvements in performance. To combat the complexity of design choices in multi-branch architectures, prior work has adopted simple strategies, such as a fixed branching factor, the same input being fed to all parallel branches, and an additive combination of the outputs produced by all branches at aggregation points. In this work we remove these predefined choices and propose an algorithm to learn the connections between branches in the network. Instead of being chosen a priori by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Machine Learning and ELM