Speeding up Convolutional Neural Networks By Exploiting the Sparsity of   Rectifier Units

Shaohuai Shi; Xiaowen Chu

arXiv:1704.07724·cs.CV·May 16, 2017·33 cites

Speeding up Convolutional Neural Networks By Exploiting the Sparsity of Rectifier Units

Shaohuai Shi, Xiaowen Chu

PDF

Open Access

TL;DR

This paper leverages the high sparsity of ReLU activations in deep convolutional networks to accelerate convolution computations by skipping zero-valued neurons, achieving notable speedups on CPUs.

Contribution

It introduces a sparse convolution algorithm that exploits ReLU sparsity to improve computational efficiency in deep CNNs.

Findings

01

Achieves speedup when ReLU sparsity exceeds 0.9

02

Demonstrates efficiency gains on CPU architectures

03

Validates the approach on popular deep CNN architectures

Abstract

Rectifier neuron units (ReLUs) have been widely used in deep convolutional networks. An ReLU converts negative values to zeros, and does not change positive values, which leads to a high sparsity of neurons. In this work, we first examine the sparsity of the outputs of ReLUs in some popular deep convolutional architectures. And then we use the sparsity property of ReLUs to accelerate the calculation of convolution by skipping calculations of zero-valued neurons. The proposed sparse convolution algorithm achieves some speedup improvements on CPUs compared to the traditional matrix-matrix multiplication algorithm for convolution when the sparsity is not less than 0.9.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTensor decomposition and applications · Machine Learning and ELM · Stochastic Gradient Optimization Techniques

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Convolution