ReActNet: Towards Precise Binary Neural Network with Generalized Activation Functions
Zechun Liu, Zhiqiang Shen, Marios Savvides, Kwang-Ting Cheng

TL;DR
ReActNet introduces generalized activation functions and a distributional loss to significantly improve the accuracy of binary neural networks, narrowing the gap with real-valued models without extra computational cost.
Contribution
It proposes a novel baseline binary network with parameter-free shortcuts and generalized activation functions, achieving state-of-the-art accuracy on ImageNet.
Findings
Outperforms existing binary networks by 3.6-4.0% in top-1 accuracy.
Reduces the accuracy gap to real-valued networks to within 3%.
Achieves superior efficiency and accuracy trade-offs.
Abstract
In this paper, we propose several ideas for enhancing a binary network to close its accuracy gap from real-valued networks without incurring any additional computational cost. We first construct a baseline network by modifying and binarizing a compact real-valued network with parameter-free shortcuts, bypassing all the intermediate convolutional layers including the downsampling layers. This baseline network strikes a good trade-off between accuracy and efficiency, achieving superior performance than most of existing binary networks at approximately half of the computational cost. Through extensive experiments and analysis, we observed that the performance of binary networks is sensitive to activation distribution variations. Based on this important observation, we propose to generalize the traditional Sign and PReLU functions, denoted as RSign and RPReLU for the respective generalized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM
MethodsParameterized ReLU
