Cnvlutin2: Ineffectual-Activation-and-Weight-Free Deep Neural Network   Computing

Patrick Judd; Alberto Delmas; Sayeh Sharify; Andreas Moshovos

arXiv:1705.00125·cs.LG·May 2, 2017·33 cites

Cnvlutin2: Ineffectual-Activation-and-Weight-Free Deep Neural Network Computing

Patrick Judd, Alberto Delmas, Sayeh Sharify, Andreas Moshovos

PDF

Open Access

TL;DR

Cnvlutin2 introduces advanced techniques to efficiently skip ineffectual activations and weights in deep neural network accelerators, reducing memory usage and energy consumption through novel encoding, memory access, and detection methods.

Contribution

The paper presents new encoding schemes, memory access strategies, and an extended architecture for skipping ineffectual activations and weights in DNN accelerators.

Findings

01

Reduced memory footprint for activations

02

Lower energy consumption in DNN processing

03

Effective skipping of ineffectual weights and activations

Abstract

We discuss several modifications and extensions over the previous proposed Cnvlutin (CNV) accelerator for convolutional and fully-connected layers of Deep Learning Network. We first describe different encodings of the activations that are deemed ineffectual. The encodings have different memory overhead and energy characteristics. We propose using a level of indirection when accessing activations from memory to reduce their memory footprint by storing only the effectual activations. We also present a modified organization that detects the activations that are deemed as ineffectual while fetching them from memory. This is different than the original design that instead detected them at the output of the preceding layer. Finally, we present an extended CNV that can also skip ineffectual weights.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Ferroelectric and Negative Capacitance Devices · Machine Learning in Materials Science