BasisN: Reprogramming-Free RRAM-Based In-Memory-Computing by Basis Combination for Deep Neural Networks
Amro Eldebiky, Grace Li Zhang, Xunzhao Yin, Cheng Zhuo, Ing-Chao Lin,, Ulf Schlichtmann, Bing Li

TL;DR
BasisN enables efficient in-memory computing for large-scale DNNs on RRAM-based accelerators by using shared basis vectors, eliminating reprogramming and significantly reducing inference energy and time.
Contribution
The paper introduces a novel basis representation for DNN kernels that allows reprogramming-free acceleration on RRAM crossbars, improving efficiency for large models.
Findings
Reduced inference cycles and energy-delay to below 1% of reprogramming methods
Achieved efficient processing of large-scale DNNs like DenseNet and ResNet
Negligible training and hardware costs
Abstract
Deep neural networks (DNNs) have made breakthroughs in various fields including image recognition and language processing. DNNs execute hundreds of millions of multiply-and-accumulate (MAC) operations. To efficiently accelerate such computations, analog in-memory-computing platforms have emerged leveraging emerging devices such as resistive RAM (RRAM). However, such accelerators face the hurdle of being required to have sufficient on-chip crossbars to hold all the weights of a DNN. Otherwise, RRAM cells in the crossbars need to be reprogramed to process further layers, which causes huge time/energy overhead due to the extremely slow writing and verification of the RRAM cells. As a result, it is still not possible to deploy such accelerators to process large-scale DNNs in industry. To address this problem, we propose the BasisN framework to accelerate DNNs on any number of available…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Machine Learning and ELM
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · Dropout · Dense Connections · Batch Normalization · Dense Block · Max Pooling · Softmax · 1x1 Convolution · Global Average Pooling
