ReDy: A Novel ReRAM-centric Dynamic Quantization Approach for Energy-efficient CNN Inference
Mohammad Sabri, Marc Riera, and Antonio Gonz\'alez

TL;DR
ReDy introduces a ReRAM-centric dynamic quantization method for CNN inference that adaptively reduces energy consumption by optimizing activation precision, significantly improving energy efficiency with minimal accuracy loss.
Contribution
The paper presents a novel hardware accelerator that dynamically adjusts activation quantization levels based on statistical analysis, enhancing energy efficiency in ReRAM-based DNN accelerators.
Findings
ReDy achieves 13% average energy savings over existing accelerators.
It reduces the number of analog-to-digital conversions needed.
ReDy maintains negligible accuracy loss with minimal area overhead.
Abstract
The primary operation in DNNs is the dot product of quantized input activations and weights. Prior works have proposed the design of memory-centric architectures based on the Processing-In-Memory (PIM) paradigm. Resistive RAM (ReRAM) technology is especially appealing for PIM-based DNN accelerators due to its high density to store weights, low leakage energy, low read latency, and high performance capabilities to perform the DNN dot-products massively in parallel within the ReRAM crossbars. However, the main bottleneck of these architectures is the energy-hungry analog-to-digital conversions (ADCs) required to perform analog computations in-ReRAM, which penalizes the efficiency and performance benefits of PIM. To improve energy-efficiency of in-ReRAM analog dot-product computations we present ReDy, a hardware accelerator that implements a ReRAM-centric Dynamic quantization scheme to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Neural Networks and Applications
