SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
Angshuman Parashar, Minsoo Rhu, Anurag Mukkara, Antonio Puglielli,, Rangharajan Venkatesan, Brucek Khailany, Joel Emer, Stephen W. Keckler, and, William J. Dally

TL;DR
SCNN is a specialized accelerator architecture that leverages sparsity in CNN weights and activations to significantly improve performance and energy efficiency, making it suitable for mobile and embedded applications.
Contribution
The paper introduces a novel sparse dataflow and hardware design for CNN acceleration that exploits zero-valued weights and activations to enhance efficiency.
Findings
SCNN achieves 2.7x performance improvement over dense accelerators.
SCNN reduces energy consumption by 2.3x compared to dense CNN accelerators.
The architecture effectively maintains and processes sparse data in compressed form.
Abstract
Convolutional Neural Networks (CNNs) have emerged as a fundamental technology for machine learning. High performance and extreme energy efficiency are critical for deployments of CNNs in a wide range of situations, especially mobile platforms such as autonomous vehicles, cameras, and electronic personal assistants. This paper introduces the Sparse CNN (SCNN) accelerator architecture, which improves performance and energy efficiency by exploiting the zero-valued weights that stem from network pruning during training and zero-valued activations that arise from the common ReLU operator applied during inference. Specifically, SCNN employs a novel dataflow that enables maintaining the sparse weights and activations in a compressed encoding, which eliminates unnecessary data transfers and reduces storage requirements. Furthermore, the SCNN dataflow facilitates efficient delivery of those…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Image Processing Techniques and Applications · Digital Media Forensic Detection
MethodsPruning · *Communicated@Fast*How Do I Communicate to Expedia?
