BD-Net: Has Depth-Wise Convolution Ever Been Applied in Binary Neural Networks?
DoYoung Kim, Jin-Seop Lee, Noo-ri Kim, SungJoon Lee, Jee-Hyong Lee

TL;DR
This paper introduces a novel approach to binarize depth-wise convolutions in Binary Neural Networks, significantly improving efficiency and accuracy across multiple datasets by combining a 1.58-bit convolution and residual stabilization techniques.
Contribution
It presents the first successful binarization of depth-wise convolutions in BNNs, enhancing expressiveness and training stability with innovative convolution and residual methods.
Findings
Achieves state-of-the-art BNN performance on ImageNet with MobileNet V1.
Outperforms prior methods on CIFAR-10, CIFAR-100, STL-10, Tiny ImageNet, and Oxford Flowers 102.
Improves accuracy by up to 9.3 percentage points across datasets.
Abstract
Recent advances in model compression have highlighted the potential of low-bit precision techniques, with Binary Neural Networks (BNNs) attracting attention for their extreme efficiency. However, extreme quantization in BNNs limits representational capacity and destabilizes training, posing significant challenges for lightweight architectures with depth-wise convolutions. To address this, we propose a 1.58-bit convolution to enhance expressiveness and a pre-BN residual connection to stabilize optimization by improving the Hessian condition number. These innovations enable, to the best of our knowledge, the first successful binarization of depth-wise convolutions in BNNs. Our method achieves 33M OPs on ImageNet with MobileNet V1, establishing a new state-of-the-art in BNNs by outperforming prior methods with comparable OPs. Moreover, it consistently outperforms existing methods across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis
