A Bit Level Weight Reordering Strategy Based on Column Similarity to Explore Weight Sparsity in RRAM-based NN Accelerator
Weiping Yang, Shilin Zhou, Hui Xu, Yujiao Nie, Qimin Zhou, Zhiwei Li, Changlin Chen

TL;DR
This paper introduces a bit-level weight reordering strategy for RRAM-based neural network accelerators that enhances weight sparsity utilization, leading to significant performance and energy efficiency improvements.
Contribution
The paper proposes a novel bit reordering method that exploits bit similarity and sparsity to improve weight mapping in RRAM-based accelerators, enabling better integration of CIM and sparsity techniques.
Findings
61.24% average performance improvement
1.51x-2.52x energy savings
Effective weight compression with minimal overhead
Abstract
Compute-in-Memory (CIM) and weight sparsity are two effective techniques to reduce data movement during Neural Network (NN) inference. However, they can hardly be employed in the same accelerator simultaneously because CIM requires structural compute patterns which are disrupted in sparse NNs. In this paper, we partially solve this issue by proposing a bit level weight reordering strategy which can realize compact mapping of sparse NN weight matrices onto Resistive Random Access Memory (RRAM) based NN Accelerators (RRAM-Acc). In specific, when weights are mapped to RRAM crossbars in a binary complement manner, we can observe that, which can also be mathematically proven, bit-level sparsity and similarity commonly exist in the crossbars. The bit reordering method treats bit sparsity as a special case of bit similarity, reserve only one column in a pair of columns that have identical bit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Advanced Neural Network Applications · Ferroelectric and Negative Capacitance Devices
