XNOR-Net++: Improved Binary Neural Networks
Adrian Bulat, Georgios Tzimiropoulos

TL;DR
This paper introduces an improved training method for binary neural networks that learns a unified scale factor via backpropagation, resulting in significantly better accuracy on ImageNet compared to previous methods.
Contribution
It proposes a discriminatively learned scale factor for binary neural networks, replacing analytically calculated ones, and explores various shape constructions to enhance accuracy.
Findings
Achieves up to 6% higher accuracy on ImageNet.
Learned scale factors outperform analytically calculated ones.
Method maintains computational efficiency while improving performance.
Abstract
This paper proposes an improved training algorithm for binary neural networks in which both weights and activations are binary numbers. A key but fairly overlooked feature of the current state-of-the-art method of XNOR-Net is the use of analytically calculated real-valued scaling factors for re-weighting the output of binary convolutions. We argue that analytic calculation of these factors is sub-optimal. Instead, in this work, we make the following contributions: (a) we propose to fuse the activation and weight scaling factors into a single one that is learned discriminatively via backpropagation. (b) More importantly, we explore several ways of constructing the shape of the scale factors while keeping the computational budget fixed. (c) We empirically measure the accuracy of our approximations and show that they are significantly more accurate than the analytically calculated one. (d)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Anomaly Detection Techniques and Applications · COVID-19 diagnosis using AI
