Iterative Training: Finding Binary Weight Deep Neural Networks with   Layer Binarization

Cheng-Chou Lan

arXiv:2111.07046·cs.LG·November 16, 2021

Iterative Training: Finding Binary Weight Deep Neural Networks with Layer Binarization

Cheng-Chou Lan

PDF

Open Access 1 Repo

TL;DR

This paper introduces an iterative training method for binary weight neural networks, emphasizing layer-wise binarization order and sensitivity pre-training to improve accuracy in deep networks.

Contribution

It proposes a novel iterative training approach with guided layer binarization and sensitivity pre-training to enhance binary neural network performance.

Findings

01

Layer binarization order affects accuracy.

02

Starting from partial binary weights improves results.

03

Guided binarization further enhances accuracy.

Abstract

In low-latency or mobile applications, lower computation complexity, lower memory footprint and better energy efficiency are desired. Many prior works address this need by removing redundant parameters. Parameter quantization replaces floating-point arithmetic with lower precision fixed-point arithmetic, further reducing complexity. Typical training of quantized weight neural networks starts from fully quantized weights. Quantization creates random noise. As a way to compensate for this noise, during training, we propose to quantize some weights while keeping others in floating-point precision. A deep neural network has many layers. To arrive at a fully quantized weight network, we start from one quantized layer and then quantize more and more layers. We show that the order of layer quantization affects accuracies. Order count is large for deep neural networks. A sensitivity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rakutentech/iterative_training
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI