BinaryDuo: Reducing Gradient Mismatch in Binary Activation Network by   Coupling Binary Activations

Hyungjun Kim; Kyungsu Kim; Jinseok Kim; Jae-Joon Kim

arXiv:2002.06517·cs.LG·February 18, 2020·31 cites

BinaryDuo: Reducing Gradient Mismatch in Binary Activation Network by Coupling Binary Activations

Hyungjun Kim, Kyungsu Kim, Jinseok Kim, Jae-Joon Kim

PDF

Open Access 1 Repo

TL;DR

BinaryDuo introduces a novel training scheme coupling binary activations into a ternary form, effectively reducing gradient mismatch and improving performance of binary neural networks without increasing computational cost.

Contribution

The paper proposes BinaryDuo, a new training method that couples binary activations into a ternary form to address gradient mismatch in BNNs, outperforming existing methods.

Findings

01

BinaryDuo outperforms state-of-the-art BNNs on multiple benchmarks.

02

Using higher precision activations is more effective than modifying activation approximations.

03

Gradient mismatch can be better estimated with a smoothed loss function gradient.

Abstract

Binary Neural Networks (BNNs) have been garnering interest thanks to their compute cost reduction and memory savings. However, BNNs suffer from performance degradation mainly due to the gradient mismatch caused by binarizing activations. Previous works tried to address the gradient mismatch problem by reducing the discrepancy between activation functions used at forward pass and its differentiable approximation used at backward pass, which is an indirect measure. In this work, we use the gradient of smoothed loss function to better estimate the gradient mismatch in quantized neural network. Analysis using the gradient mismatch estimator indicates that using higher precision for activation is more effective than modifying the differentiable approximation of activation function. Based on the observation, we propose a new training scheme for binary activation networks called BinaryDuo in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Hyungjun-K1m/BinaryDuo
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM