CoDR: Computation and Data Reuse Aware CNN Accelerator

Alireza Khadem; Haojie Ye; Trevor Mudge

arXiv:2104.09798·cs.AR·April 21, 2021

CoDR: Computation and Data Reuse Aware CNN Accelerator

Alireza Khadem, Haojie Ye, Trevor Mudge

PDF

Open Access

TL;DR

CoDR is a CNN accelerator that enhances computation and data reuse by exploiting sparsity, similarity, and repetition, significantly reducing memory access and energy consumption.

Contribution

It introduces Universal Computation Reuse and a customized encoding scheme, optimizing memory access and energy efficiency in CNN accelerators.

Findings

01

Reduces SRAM access by up to 8x

02

Consumes significantly less energy than recent accelerators

03

Exploits weight sparsity, repetition, and similarity simultaneously

Abstract

Computation and Data Reuse is critical for the resource-limited Convolutional Neural Network (CNN) accelerators. This paper presents Universal Computation Reuse to exploit weight sparsity, repetition, and similarity simultaneously in a convolutional layer. Moreover, CoDR decreases the cost of weight memory access by proposing a customized Run-Length Encoding scheme and the number of memory accesses to the intermediate results by introducing an input and output stationary dataflow. Compared to two recent compressed CNN accelerators with the same area of 2.85 mm^2, CoDR decreases SRAM access by 5.08x and 7.99x, and consumes 3.76x and 6.84x less energy.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices