The Power of Sparsity in Convolutional Neural Networks

Soravit Changpinyo; Mark Sandler; Andrey Zhmoginov

arXiv:1702.06257·cs.CV·February 22, 2017·92 cites

The Power of Sparsity in Convolutional Neural Networks

Soravit Changpinyo, Mark Sandler, Andrey Zhmoginov

PDF

Open Access

TL;DR

This paper explores the use of sparsity in convolutional neural networks, demonstrating that channel-wise sparse connections can significantly reduce computational costs while maintaining accuracy, especially in large architectures.

Contribution

It introduces a generalized channel-wise sparse connection strategy for convolutional layers, improving efficiency over simple channel reduction methods.

Findings

01

Sparse connections outperform baseline channel reduction.

02

Significant computational and memory savings achieved.

03

Effective in large networks like VGG and Inception V3.

Abstract

Deep convolutional networks are well-known for their high computational and memory demands. Given limited resources, how does one design a network that balances its size, training time, and prediction accuracy? A surprisingly effective approach to trade accuracy for size and speed is to simply reduce the number of channels in each convolutional layer by a fixed fraction and retrain the network. In many cases this leads to significantly smaller networks with only minimal changes to accuracy. In this paper, we take a step further by empirically examining a strategy for deactivating connections between filters in convolutional layers in a way that allows us to harvest savings both in run-time and memory for many network architectures. More specifically, we generalize 2D convolution to use a channel-wise sparse connection structure and show that this leads to significantly better results…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Dropout · Dense Connections · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Softmax · Ethereum Customer Service Number +1-833-534-1729 · Convolution