RAPIDNN: In-Memory Deep Neural Network Acceleration Framework

Mohsen Imani; Mohammad Samragh; Yeseong Kim; Saransh Gupta; Farinaz; Koushanfar; Tajana Rosing

arXiv:1806.05794·cs.NE·April 15, 2019·47 cites

RAPIDNN: In-Memory Deep Neural Network Acceleration Framework

Mohsen Imani, Mohammad Samragh, Yeseong Kim, Saransh Gupta, Farinaz, Koushanfar, Tajana Rosing

PDF

Open Access

TL;DR

RAPIDNN is an in-memory DNN acceleration framework that minimizes data movement by processing all operations within memory, achieving significant energy and speed improvements over existing accelerators with minimal accuracy loss.

Contribution

It introduces a novel in-memory processing framework for DNNs that reinterprets models and maps them into specialized memory-based accelerators, reducing data transfer costs.

Findings

01

68.4x energy efficiency improvement

02

48.1x speedup over state-of-the-art accelerators

03

less than 0.3% accuracy loss

Abstract

Deep neural networks (DNN) have demonstrated effectiveness for various applications such as image processing, video segmentation, and speech recognition. Running state-of-the-art DNNs on current systems mostly relies on either generalpurpose processors, ASIC designs, or FPGA accelerators, all of which suffer from data movements due to the limited onchip memory and data transfer bandwidth. In this work, we propose a novel framework, called RAPIDNN, which processes all DNN operations within the memory to minimize the cost of data movement. To enable in-memory processing, RAPIDNN reinterprets a DNN model and maps it into a specialized accelerator, which is designed using non-volatile memory blocks that model four fundamental DNN operations, i.e., multiplication, addition, activation functions, and pooling. The framework extracts representative operands of a DNN model, e.g., weights and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Advanced Neural Network Applications · Ferroelectric and Negative Capacitance Devices