BBS: Bi-directional Bit-level Sparsity for Deep Learning Acceleration
Yuzong Chen, Jian Meng, Jae-sun Seo, Mohamed S. Abdelfattah

TL;DR
This paper introduces Bidirectional Bit sparsity (BBS), a novel method that leverages symmetrical bit pruning and a hardware accelerator to significantly improve deep learning efficiency, reducing model size, increasing speed, and saving energy.
Contribution
The paper proposes BBS, a new symmetric bit pruning technique combined with a hardware accelerator, enhancing bit-level sparsity and efficiency in deep neural network acceleration.
Findings
Achieves 1.66× reduction in model size with minimal accuracy loss
Provides up to 3.03× speedup over prior accelerators
Saves 2.44× energy compared to existing DNN hardware
Abstract
Bit-level sparsity methods skip ineffectual zero-bit operations and are typically applicable within bit-serial deep learning accelerators. This type of sparsity at the bit-level is especially interesting because it is both orthogonal and compatible with other deep neural network (DNN) efficiency methods such as quantization and pruning. In this work, we improve the practicality and efficiency of bitlevel sparsity through a novel algorithmic bit-pruning, averaging, and compression method, and a co-designed efficient bit-serial hardware accelerator. On the algorithmic side, we introduce bidirectional bit sparsity (BBS). The key insight of BBS is that we can leverage bit sparsity in a symmetrical way to prune either zero-bits or one-bits. This significantly improves the load balance of bit-serial computing and guarantees the level of sparsity to be more than 50%. On top of BBS, we further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · CCD and CMOS Imaging Sensors · Particle Detector Development and Performance
MethodsPruning
