Efficient Sparse-Winograd Convolutional Neural Networks
Xingyu Liu, Jeff Pool, Song Han, William J. Dally

TL;DR
This paper introduces modifications to Winograd convolution algorithms that enable the exploitation of sparsity in CNNs, significantly reducing computation while maintaining accuracy, especially on mobile devices.
Contribution
The authors propose moving ReLU into the Winograd domain and pruning weights there, enabling combined sparsity exploitation in Winograd-based CNNs.
Findings
Achieved up to 10.8x reduction in multiplications on ImageNet.
Maintained less than 0.1% accuracy loss with significant speedups.
Outperformed previous methods by 2.0x to 3.0x in efficiency.
Abstract
Convolutional Neural Networks (CNNs) are computationally intensive, which limits their application on mobile devices. Their energy is dominated by the number of multiplies needed to perform the convolutions. Winograd's minimal filtering algorithm (Lavin, 2015) and network pruning (Han et al., 2015) can reduce the operation count, but these two methods cannot be directly combined applying the Winograd transform fills in the sparsity in both the weights and the activations. We propose two modifications to Winograd-based CNNs to enable these methods to exploit sparsity. First, we move the ReLU operation into the Winograd domain to increase the sparsity of the transformed activations. Second, we prune the weights in the Winograd domain to exploit static weight sparsity. For models on CIFAR-10, CIFAR-100 and ImageNet datasets, our method reduces the number of multiplications by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
MethodsPruning · *Communicated@Fast*How Do I Communicate to Expedia?
