Training Binary Neural Networks with Real-to-Binary Convolutions

Brais Martinez; Jing Yang; Adrian Bulat; Georgios; Tzimiropoulos

arXiv:2003.11535·cs.CV·March 26, 2020·69 cites

Training Binary Neural Networks with Real-to-Binary Convolutions

Brais Martinez, Jing Yang, Adrian Bulat, Georgios, Tzimiropoulos

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method to train binary neural networks that nearly match the accuracy of full-precision models by minimizing discrepancies between binary and real-valued convolutions, achieving state-of-the-art results on ImageNet.

Contribution

The paper proposes a novel approach combining loss functions and data-driven re-scaling to improve binary neural network accuracy, narrowing the gap with real-valued networks.

Findings

01

Achieves within 3-5% of full-precision accuracy on ImageNet.

02

Surpasses previous binary network methods by over 5% top-1 accuracy.

03

Reduces the accuracy gap to less than 3-5% on CIFAR-100 and ImageNet.

Abstract

This paper shows how to train binary networks to within a few percent points ( $\sim 3 - 5%$ ) of the full precision counterpart. We first show how to build a strong baseline, which already achieves state-of-the-art accuracy, by combining recently proposed advances and carefully adjusting the optimization procedure. Secondly, we show that by attempting to minimize the discrepancy between the output of the binary and the corresponding real-valued convolution, additional significant accuracy gains can be obtained. We materialize this idea in two complementary ways: (1) with a loss function, during training, by matching the spatial attention maps computed at the output of the binary and real-valued convolutions, and (2) in a data-driven manner, by using the real-valued activations, available during inference prior to the binarization process, for re-scaling the activations right after the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

brais-martinez/real2binary
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Sparse and Compressive Sensing Techniques