SQWA: Stochastic Quantized Weight Averaging for Improving the Generalization Capability of Low-Precision Deep Neural Networks
Sungho Shin, Yoonho Boo, Wonyong Sung

TL;DR
This paper introduces SQWA, a novel stochastic weight averaging method for low-precision deep neural networks, which improves their generalization by averaging multiple quantized models during training.
Contribution
The paper proposes a new optimization approach, SQWA, that enhances low-precision DNNs' generalization by combining model averaging with quantization and visualization techniques.
Findings
Achieved state-of-the-art results for 2-bit QDNNs on CIFAR-100 and ImageNet.
Quantized DNNs with SQWA are located near flat minima, indicating better generalization.
Performance exceeds previous non-uniform quantization methods despite using uniform quantization.
Abstract
Designing a deep neural network (DNN) with good generalization capability is a complex process especially when the weights are severely quantized. Model averaging is a promising approach for achieving the good generalization capability of DNNs, especially when the loss surface for training contains many sharp minima. We present a new quantized neural network optimization approach, stochastic quantized weight averaging (SQWA), to design low-precision DNNs with good generalization capability using model averaging. The proposed approach includes (1) floating-point model training, (2) direct quantization of weights, (3) capturing multiple low-precision models during retraining with cyclical learning rates, (4) averaging the captured models, and (5) re-quantizing the averaged model and fine-tuning it with low-learning rates. Additionally, we present a loss-visualization technique on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
