Resilient Binary Neural Network
Sheng Xu, Yanjing Li, Teli Ma, Mingbao Lin, Hao Dong, Baochang Zhang,, Peng Gao, Jinhu Lv

TL;DR
This paper introduces ReBNN, a resilient training method for binary neural networks that mitigates weight oscillation by parameterizing the scaling factor and using an adaptive loss, leading to improved performance across vision and NLP tasks.
Contribution
The paper proposes a novel approach to control weight oscillation in BNNs by parameterizing the scaling factor and incorporating a weighted reconstruction loss, with theoretical and empirical validation.
Findings
ReBNN achieves 66.9% Top-1 accuracy on ImageNet with ResNet-18.
ReBNN outperforms prior BNN methods across multiple models and tasks.
The method effectively mitigates weight oscillation during training.
Abstract
Binary neural networks (BNNs) have received ever-increasing popularity for their great capability of reducing storage burden as well as quickening inference time. However, there is a severe performance drop compared with real-valued networks, due to its intrinsic frequent weight oscillation during training. In this paper, we introduce a Resilient Binary Neural Network (ReBNN) to mitigate the frequent oscillation for better BNNs' training. We identify that the weight oscillation mainly stems from the non-parametric scaling factor. To address this issue, we propose to parameterize the scaling factor and introduce a weighted reconstruction loss to build an adaptive training objective. For the first time, we show that the weight oscillation is controlled by the balanced parameter attached to the reconstruction loss, which provides a theoretical foundation to parameterize it in back…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · COVID-19 diagnosis using AI · Brain Tumor Detection and Classification
MethodsMulti-Head Attention · Attention Is All You Need · *Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · 1x1 Convolution · Average Pooling · Linear Layer · Bottleneck Residual Block · Residual Block · Refunds@Expedia|||How do I get a full refund from Expedia?
