FTBNN: Rethinking Non-linearity for 1-bit CNNs and Going Beyond
Zhuo Su, Linpu Fang, Deke Guo, Dewen Hu, Matti Pietik\"ainen, Li Liu

TL;DR
This paper introduces FTBNN, a novel approach to binary neural networks that emphasizes the importance of non-linearity, leading to state-of-the-art accuracy on ImageNet and enhanced model capacity without increasing computational cost.
Contribution
We propose a rethinking of non-linearity in BNNs, achieving improved accuracy and efficiency, and demonstrate potential for further compression and capacity enhancement.
Findings
Achieved state-of-the-art accuracy on ImageNet with BNNs.
Enhanced BNN capacity using group execution.
Improved accuracy by 4-5% with less computation.
Abstract
Binary neural networks (BNNs), where both weights and activations are binarized into 1 bit, have been widely studied in recent years due to its great benefit of highly accelerated computation and substantially reduced memory footprint that appeal to the development of resource constrained devices. In contrast to previous methods tending to reduce the quantization error for training BNN structures, we argue that the binarized convolution process owns an increasing linearity towards the target of minimizing such error, which in turn hampers BNN's discriminative ability. In this paper, we re-investigate and tune proper non-linear modules to fix that contradiction, leading to a strong baseline which achieves state-of-the-art performance on the large-scale ImageNet dataset in terms of accuracy and training efficiency. To go further, we find that the proposed BNN model still has much…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Neural Networks and Applications
MethodsConvolution
