Bimodal Distributed Binarized Neural Networks
Tal Rozen, Moshe Kimhi, Brian Chmiel, Avi Mendelson, Chaim Baskin

TL;DR
This paper introduces a novel binarization method for neural networks called Bi-Modal Distributed Binarization, which uses kurtosis regularization and a training scheme to improve performance and reduce error in binary neural networks.
Contribution
It proposes a new binarization technique that enforces a bi-modal weight distribution using kurtosis regularization and a training scheme called WDM, enhancing binary neural network accuracy.
Findings
Outperforms state-of-the-art on CIFAR-10 and ImageNet
Reduces generalization error in BNNs
Creates robust binary feature maps
Abstract
Binary Neural Networks (BNNs) are an extremely promising method to reduce deep neural networks' complexity and power consumption massively. Binarization techniques, however, suffer from ineligible performance degradation compared to their full-precision counterparts. Prior work mainly focused on strategies for sign function approximation during forward and backward phases to reduce the quantization error during the binarization process. In this work, we propose a Bi-Modal Distributed binarization method (\methodname{}). That imposes bi-modal distribution of the network weights by kurtosis regularization. The proposed method consists of a training scheme that we call Weight Distribution Mimicking (WDM), which efficiently imitates the full-precision network weight distribution to their binary counterpart. Preserving this distribution during binarization-aware training creates robust and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI
