BasisN: Reprogramming-Free RRAM-Based In-Memory-Computing by Basis   Combination for Deep Neural Networks

Amro Eldebiky; Grace Li Zhang; Xunzhao Yin; Cheng Zhuo; Ing-Chao Lin,; Ulf Schlichtmann; Bing Li

arXiv:2407.03738·eess.SY·July 8, 2024

BasisN: Reprogramming-Free RRAM-Based In-Memory-Computing by Basis Combination for Deep Neural Networks

Amro Eldebiky, Grace Li Zhang, Xunzhao Yin, Cheng Zhuo, Ing-Chao Lin,, Ulf Schlichtmann, Bing Li

PDF

Open Access

TL;DR

BasisN enables efficient in-memory computing for large-scale DNNs on RRAM-based accelerators by using shared basis vectors, eliminating reprogramming and significantly reducing inference energy and time.

Contribution

The paper introduces a novel basis representation for DNN kernels that allows reprogramming-free acceleration on RRAM crossbars, improving efficiency for large models.

Findings

01

Reduced inference cycles and energy-delay to below 1% of reprogramming methods

02

Achieved efficient processing of large-scale DNNs like DenseNet and ResNet

03

Negligible training and hardware costs

Abstract

Deep neural networks (DNNs) have made breakthroughs in various fields including image recognition and language processing. DNNs execute hundreds of millions of multiply-and-accumulate (MAC) operations. To efficiently accelerate such computations, analog in-memory-computing platforms have emerged leveraging emerging devices such as resistive RAM (RRAM). However, such accelerators face the hurdle of being required to have sufficient on-chip crossbars to hold all the weights of a DNN. Otherwise, RRAM cells in the crossbars need to be reprogramed to process further layers, which causes huge time/energy overhead due to the extremely slow writing and verification of the RRAM cells. As a result, it is still not possible to deploy such accelerators to process large-scale DNNs in industry. To address this problem, we propose the BasisN framework to accelerate DNNs on any number of available…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Machine Learning and ELM

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · Dropout · Dense Connections · Batch Normalization · Dense Block · Max Pooling · Softmax · 1x1 Convolution · Global Average Pooling