SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks

Angshuman Parashar; Minsoo Rhu; Anurag Mukkara; Antonio Puglielli,; Rangharajan Venkatesan; Brucek Khailany; Joel Emer; Stephen W. Keckler; and; William J. Dally

arXiv:1708.04485·cs.NE·August 16, 2017·125 cites

SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks

Angshuman Parashar, Minsoo Rhu, Anurag Mukkara, Antonio Puglielli,, Rangharajan Venkatesan, Brucek Khailany, Joel Emer, Stephen W. Keckler, and, William J. Dally

PDF

Open Access

TL;DR

SCNN is a specialized accelerator architecture that leverages sparsity in CNN weights and activations to significantly improve performance and energy efficiency, making it suitable for mobile and embedded applications.

Contribution

The paper introduces a novel sparse dataflow and hardware design for CNN acceleration that exploits zero-valued weights and activations to enhance efficiency.

Findings

01

SCNN achieves 2.7x performance improvement over dense accelerators.

02

SCNN reduces energy consumption by 2.3x compared to dense CNN accelerators.

03

The architecture effectively maintains and processes sparse data in compressed form.

Abstract

Convolutional Neural Networks (CNNs) have emerged as a fundamental technology for machine learning. High performance and extreme energy efficiency are critical for deployments of CNNs in a wide range of situations, especially mobile platforms such as autonomous vehicles, cameras, and electronic personal assistants. This paper introduces the Sparse CNN (SCNN) accelerator architecture, which improves performance and energy efficiency by exploiting the zero-valued weights that stem from network pruning during training and zero-valued activations that arise from the common ReLU operator applied during inference. Specifically, SCNN employs a novel dataflow that enables maintaining the sparse weights and activations in a compressed encoding, which eliminates unnecessary data transfers and reduces storage requirements. Furthermore, the SCNN dataflow facilitates efficient delivery of those…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Image Processing Techniques and Applications · Digital Media Forensic Detection

MethodsPruning · *Communicated@Fast*How Do I Communicate to Expedia?