BitWave: Exploiting Column-Based Bit-Level Sparsity for Deep Learning Acceleration
Man Shi, Vikram Jain, Antony Joseph, Maurice Meijer, Marian Verhelst

TL;DR
BitWave introduces a novel architecture leveraging structured bit-level sparsity and dynamic dataflow to significantly accelerate deep neural network computations while reducing memory and power consumption.
Contribution
The paper proposes the 'bit-column-serial' approach and the BitWave architecture, enabling efficient exploitation of structured bit-level sparsity in DNNs with post-training optimization.
Findings
Achieves up to 13.25x speedup over state-of-the-art accelerators.
Realizes 7.71x higher efficiency in DNN processing.
Consumes 17.56 mW power in a 16nm process.
Abstract
Bit-serial computation facilitates bit-wise sequential data processing, offering numerous benefits, such as a reduced area footprint and dynamically-adaptive computational precision. It has emerged as a prominent approach, particularly in leveraging bit-level sparsity in Deep Neural Networks (DNNs). However, existing bit-serial accelerators exploit bit-level sparsity to reduce computations by skipping zero bits, but they suffer from inefficient memory accesses due to the irregular indices of the non-zero bits. As memory accesses typically are the dominant contributor to DNN accelerator performance, this paper introduces a novel computing approach called "bit-column-serial" and a compatible architecture design named "BitWave." BitWave harnesses the advantages of the "bit-column-serial" approach, leveraging structured bit-level sparsity in combination with dynamic dataflow techniques.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
