Bi-Real Net: Enhancing the Performance of 1-bit CNNs With Improved Representational Capability and Advanced Training Algorithm
Zechun Liu, Baoyuan Wu, Wenhan Luo, Xin Yang, Wei Liu, and Kwang-Ting, Cheng

TL;DR
This paper introduces Bi-Real Net, a novel 1-bit CNN architecture with enhanced representational capacity and a specialized training algorithm, significantly improving accuracy on ImageNet while maintaining efficiency.
Contribution
The paper proposes a new Bi-Real Net architecture with identity shortcuts and a tailored training method to close the performance gap of 1-bit CNNs to real-valued models.
Findings
Achieves 56.4% top-1 accuracy with 18 layers on ImageNet.
Outperforms state-of-the-art 1-bit CNNs like XNOR Net by up to 10%.
Maintains low computational cost and memory efficiency.
Abstract
In this work, we study the 1-bit convolutional neural networks (CNNs), of which both the weights and activations are binary. While being efficient, the classification accuracy of the current 1-bit CNNs is much worse compared to their counterpart real-valued CNN models on the large-scale dataset, like ImageNet. To minimize the performance gap between the 1-bit and real-valued CNN models, we propose a novel model, dubbed Bi-Real net, which connects the real activations (after the 1-bit convolution and/or BatchNorm layer, before the sign function) to activations of the consecutive block, through an identity shortcut. Consequently, compared to the standard 1-bit CNN, the representational capability of the Bi-Real net is significantly enhanced and the additional cost on computation is negligible. Moreover, we develop a specific training algorithm including three technical novelties for 1-…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Convolution
