Iterative Training: Finding Binary Weight Deep Neural Networks with Layer Binarization
Cheng-Chou Lan

TL;DR
This paper introduces an iterative training method for binary weight neural networks, emphasizing layer-wise binarization order and sensitivity pre-training to improve accuracy in deep networks.
Contribution
It proposes a novel iterative training approach with guided layer binarization and sensitivity pre-training to enhance binary neural network performance.
Findings
Layer binarization order affects accuracy.
Starting from partial binary weights improves results.
Guided binarization further enhances accuracy.
Abstract
In low-latency or mobile applications, lower computation complexity, lower memory footprint and better energy efficiency are desired. Many prior works address this need by removing redundant parameters. Parameter quantization replaces floating-point arithmetic with lower precision fixed-point arithmetic, further reducing complexity. Typical training of quantized weight neural networks starts from fully quantized weights. Quantization creates random noise. As a way to compensate for this noise, during training, we propose to quantize some weights while keeping others in floating-point precision. A deep neural network has many layers. To arrive at a fully quantized weight network, we start from one quantized layer and then quantize more and more layers. We show that the order of layer quantization affects accuracies. Order count is large for deep neural networks. A sensitivity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI
