Latent Weights Do Not Exist: Rethinking Binarized Neural Network   Optimization

Koen Helwegen; James Widdicombe; Lukas Geiger; Zechun Liu; Kwang-Ting; Cheng; Roeland Nusselder

arXiv:1906.02107·cs.LG·November 7, 2019·38 cites

Latent Weights Do Not Exist: Rethinking Binarized Neural Network Optimization

Koen Helwegen, James Widdicombe, Lukas Geiger, Zechun Liu, Kwang-Ting, Cheng, Roeland Nusselder

PDF

Open Access 3 Repos

TL;DR

This paper challenges the traditional view of latent weights in Binarized Neural Networks, proposing they serve as inertia rather than real weights, and introduces a new optimizer, Bop, tailored for BNN training.

Contribution

It redefines the role of latent weights in BNNs as inertia and introduces Bop, the first optimizer specifically designed for BNNs, improving training understanding and performance.

Findings

01

Bop outperforms existing optimizers on CIFAR-10 and ImageNet.

02

Redefining latent weights as inertia enhances BNN optimization understanding.

03

The new perspective facilitates future improvements in BNN training methods.

Abstract

Optimization of Binarized Neural Networks (BNNs) currently relies on real-valued latent weights to accumulate small update steps. In this paper, we argue that these latent weights cannot be treated analogously to weights in real-valued networks. Instead their main role is to provide inertia during training. We interpret current methods in terms of inertia and provide novel insights into the optimization of BNNs. We subsequently introduce the first optimizer specifically designed for BNNs, Binary Optimizer (Bop), and demonstrate its performance on CIFAR-10 and ImageNet. Together, the redefinition of latent weights as inertia and the introduction of Bop enable a better understanding of BNN optimization and open up the way for further improvements in training methodologies for BNNs. Code is available at: https://github.com/plumerai/rethinking-bnn-optimization

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning