TL;DR
This paper introduces efficient sparse kernels for neural networks, demonstrating that replacing dense primitives with sparse ones significantly improves real-world efficiency and accuracy on mobile devices, and provides open-source implementations.
Contribution
The authors develop and open-source efficient sparse kernels for ARM and WebAssembly, enabling sparse neural networks to outperform dense models in real-world scenarios.
Findings
Sparse networks outperform dense counterparts by 1.3-2.4x on Snapdragon 835.
Sparse versions of MobileNet and EfficientNet surpass dense baselines in efficiency and accuracy.
Open-source sparse kernels facilitate wider adoption of sparsity in deep learning architectures.
Abstract
Historically, the pursuit of efficient inference has been one of the driving forces behind research into new deep learning architectures and building blocks. Some recent examples include: the squeeze-and-excitation module, depthwise separable convolutions in Xception, and the inverted bottleneck in MobileNet v2. Notably, in all of these cases, the resulting building blocks enabled not only higher efficiency, but also higher accuracy, and found wide adoption in the field. In this work, we further expand the arsenal of efficient building blocks for neural network architectures; but instead of combining standard primitives (such as convolution), we advocate for the replacement of these dense primitives with their sparse counterparts. While the idea of using sparsity to decrease the parameter count is not new, the conventional wisdom is that this reduction in theoretical FLOPs does not…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Fast Sparse ConvNets· youtube
Taxonomy
MethodsRMSProp · Tether Customer Service Number +1-833-534-1729 · MobileNetV1 · Residual Connection · Global Average Pooling · Max Pooling · Softmax · Depthwise Convolution · Pointwise Convolution · How Do I Get a Human at Expedia?+1-805>330>4056.
