ReActNet: Towards Precise Binary Neural Network with Generalized   Activation Functions

Zechun Liu; Zhiqiang Shen; Marios Savvides; Kwang-Ting Cheng

arXiv:2003.03488·cs.CV·July 14, 2020·30 cites

ReActNet: Towards Precise Binary Neural Network with Generalized Activation Functions

Zechun Liu, Zhiqiang Shen, Marios Savvides, Kwang-Ting Cheng

PDF

Open Access 4 Repos

TL;DR

ReActNet introduces generalized activation functions and a distributional loss to significantly improve the accuracy of binary neural networks, narrowing the gap with real-valued models without extra computational cost.

Contribution

It proposes a novel baseline binary network with parameter-free shortcuts and generalized activation functions, achieving state-of-the-art accuracy on ImageNet.

Findings

01

Outperforms existing binary networks by 3.6-4.0% in top-1 accuracy.

02

Reduces the accuracy gap to real-valued networks to within 3%.

03

Achieves superior efficiency and accuracy trade-offs.

Abstract

In this paper, we propose several ideas for enhancing a binary network to close its accuracy gap from real-valued networks without incurring any additional computational cost. We first construct a baseline network by modifying and binarizing a compact real-valued network with parameter-free shortcuts, bypassing all the intermediate convolutional layers including the downsampling layers. This baseline network strikes a good trade-off between accuracy and efficiency, achieving superior performance than most of existing binary networks at approximately half of the computational cost. Through extensive experiments and analysis, we observed that the performance of binary networks is sensitive to activation distribution variations. Based on this important observation, we propose to generalize the traditional Sign and PReLU functions, denoted as RSign and RPReLU for the respective generalized…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM

MethodsParameterized ReLU