Fast Sparse ConvNets

Erich Elsen; Marat Dukhan; Trevor Gale; Karen Simonyan

arXiv:1911.09723·cs.CV·November 25, 2019

Fast Sparse ConvNets

Erich Elsen, Marat Dukhan, Trevor Gale, Karen Simonyan

PDF

5 Repos 1 Video

TL;DR

This paper introduces efficient sparse kernels for neural networks, demonstrating that replacing dense primitives with sparse ones significantly improves real-world efficiency and accuracy on mobile devices, and provides open-source implementations.

Contribution

The authors develop and open-source efficient sparse kernels for ARM and WebAssembly, enabling sparse neural networks to outperform dense models in real-world scenarios.

Findings

01

Sparse networks outperform dense counterparts by 1.3-2.4x on Snapdragon 835.

02

Sparse versions of MobileNet and EfficientNet surpass dense baselines in efficiency and accuracy.

03

Open-source sparse kernels facilitate wider adoption of sparsity in deep learning architectures.

Abstract

Historically, the pursuit of efficient inference has been one of the driving forces behind research into new deep learning architectures and building blocks. Some recent examples include: the squeeze-and-excitation module, depthwise separable convolutions in Xception, and the inverted bottleneck in MobileNet v2. Notably, in all of these cases, the resulting building blocks enabled not only higher efficiency, but also higher accuracy, and found wide adoption in the field. In this work, we further expand the arsenal of efficient building blocks for neural network architectures; but instead of combining standard primitives (such as convolution), we advocate for the replacement of these dense primitives with their sparse counterparts. While the idea of using sparsity to decrease the parameter count is not new, the conventional wisdom is that this reduction in theoretical FLOPs does not…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Fast Sparse ConvNets· youtube

Taxonomy

MethodsRMSProp · Tether Customer Service Number +1-833-534-1729 · MobileNetV1 · Residual Connection · Global Average Pooling · Max Pooling · Softmax · Depthwise Convolution · Pointwise Convolution · How Do I Get a Human at Expedia?+1-805>330>4056.