Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks
Yikai Wang, Yi Yang, Fuchun Sun, Anbang Yao

TL;DR
Sub-bit Neural Networks (SNNs) introduce a kernel-aware binary quantization method that significantly compresses and accelerates BNNs, achieving notable speed-ups and compression ratios on visual recognition tasks.
Contribution
The paper proposes a novel kernel-aware optimization framework for binary quantization, exploiting kernel subset distributions to improve BNN compression and acceleration.
Findings
SNNs achieve over 3x speed-up on ImageNet with ResNet models.
SNNs reduce model size by approximately 1.8 times compared to traditional BNNs.
Experimental validation on FPGA demonstrates practical deployment benefits.
Abstract
In the low-bit quantization field, training Binary Neural Networks (BNNs) is the extreme solution to ease the deployment of deep models on resource-constrained devices, having the lowest storage cost and significantly cheaper bit-wise operations compared to 32-bit floating-point counterparts. In this paper, we introduce Sub-bit Neural Networks (SNNs), a new type of binary quantization design tailored to compress and accelerate BNNs. SNNs are inspired by an empirical observation, showing that binary kernels learnt at convolutional layers of a BNN model are likely to be distributed over kernel subsets. As a result, unlike existing methods that binarize weights one by one, SNNs are trained with a kernel-aware optimization framework, which exploits binary quantization in the fine-grained convolutional kernel space. Specifically, our method includes a random sampling step generating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
