BitWave: Exploiting Column-Based Bit-Level Sparsity for Deep Learning Acceleration

Man Shi; Vikram Jain; Antony Joseph; Maurice Meijer; Marian Verhelst

arXiv:2507.12444·eess.SY·July 17, 2025

BitWave: Exploiting Column-Based Bit-Level Sparsity for Deep Learning Acceleration

Man Shi, Vikram Jain, Antony Joseph, Maurice Meijer, Marian Verhelst

PDF

TL;DR

BitWave introduces a novel architecture leveraging structured bit-level sparsity and dynamic dataflow to significantly accelerate deep neural network computations while reducing memory and power consumption.

Contribution

The paper proposes the 'bit-column-serial' approach and the BitWave architecture, enabling efficient exploitation of structured bit-level sparsity in DNNs with post-training optimization.

Findings

01

Achieves up to 13.25x speedup over state-of-the-art accelerators.

02

Realizes 7.71x higher efficiency in DNN processing.

03

Consumes 17.56 mW power in a 16nm process.

Abstract

Bit-serial computation facilitates bit-wise sequential data processing, offering numerous benefits, such as a reduced area footprint and dynamically-adaptive computational precision. It has emerged as a prominent approach, particularly in leveraging bit-level sparsity in Deep Neural Networks (DNNs). However, existing bit-serial accelerators exploit bit-level sparsity to reduce computations by skipping zero bits, but they suffer from inefficient memory accesses due to the irregular indices of the non-zero bits. As memory accesses typically are the dominant contributor to DNN accelerator performance, this paper introduces a novel computing approach called "bit-column-serial" and a compatible architecture design named "BitWave." BitWave harnesses the advantages of the "bit-column-serial" approach, leveraging structured bit-level sparsity in combination with dynamic dataflow techniques.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.