Starting Positions Matter: A Study on Better Weight Initialization for Neural Network Quantization
Stone Yun, Alexander Wong

TL;DR
This paper investigates how different weight initializations affect neural network quantization robustness and introduces a novel method using Graph Hypernetworks to improve quantized model accuracy, especially at low bit-widths.
Contribution
It provides the first comprehensive analysis of weight initialization effects on quantization robustness and proposes GHN-QAT, a new initialization method that enhances quantized neural network performance.
Findings
Weight initialization significantly impacts quantization robustness across CNN architectures.
GHN-QAT improves quantized accuracy, notably at 4-bit and 2-bit levels.
GHN-QAT outperforms random initialization and standard methods in quantized CNNs.
Abstract
Deep neural network (DNN) quantization for fast, efficient inference has been an important tool in limiting the cost of machine learning (ML) model inference. Quantization-specific model development techniques such as regularization, quantization-aware training, and quantization-robustness penalties have served to greatly boost the accuracy and robustness of modern DNNs. However, very little exploration has been done on improving the initial conditions of DNN training for quantization. Just as random weight initialization has been shown to significantly impact test accuracy of floating point models, it would make sense that different weight initialization methods impact quantization robustness of trained models. We present an extensive study examining the effects of different weight initializations on a variety of CNN building blocks commonly used in efficient CNNs. This analysis…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Advanced Neural Network Applications · Explainable Artificial Intelligence (XAI)
