RAPIDNN: In-Memory Deep Neural Network Acceleration Framework
Mohsen Imani, Mohammad Samragh, Yeseong Kim, Saransh Gupta, Farinaz, Koushanfar, Tajana Rosing

TL;DR
RAPIDNN is an in-memory DNN acceleration framework that minimizes data movement by processing all operations within memory, achieving significant energy and speed improvements over existing accelerators with minimal accuracy loss.
Contribution
It introduces a novel in-memory processing framework for DNNs that reinterprets models and maps them into specialized memory-based accelerators, reducing data transfer costs.
Findings
68.4x energy efficiency improvement
48.1x speedup over state-of-the-art accelerators
less than 0.3% accuracy loss
Abstract
Deep neural networks (DNN) have demonstrated effectiveness for various applications such as image processing, video segmentation, and speech recognition. Running state-of-the-art DNNs on current systems mostly relies on either generalpurpose processors, ASIC designs, or FPGA accelerators, all of which suffer from data movements due to the limited onchip memory and data transfer bandwidth. In this work, we propose a novel framework, called RAPIDNN, which processes all DNN operations within the memory to minimize the cost of data movement. To enable in-memory processing, RAPIDNN reinterprets a DNN model and maps it into a specialized accelerator, which is designed using non-volatile memory blocks that model four fundamental DNN operations, i.e., multiplication, addition, activation functions, and pooling. The framework extracts representative operands of a DNN model, e.g., weights and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Advanced Neural Network Applications · Ferroelectric and Negative Capacitance Devices
