ProxQuant: Quantized Neural Networks via Proximal Operators
Yu Bai, Yu-Xiang Wang, Edo Liberty

TL;DR
ProxQuant introduces a principled method for training quantized neural networks using proximal operators, outperforming traditional straight-through gradient methods in stability and effectiveness, especially for binary quantization.
Contribution
It formulates quantized network training as a regularized optimization problem and applies the prox-gradient method, providing a more stable and theoretically grounded alternative to straight-through gradients.
Findings
ProxQuant outperforms state-of-the-art on binary quantization of ResNets and LSTMs.
ProxQuant achieves comparable results to state-of-the-art on multi-bit quantization.
The method is more stable than traditional straight-through gradient approaches.
Abstract
To make deep neural networks feasible in resource-constrained environments (such as mobile devices), it is beneficial to quantize models by using low-precision weights. One common technique for quantizing neural networks is the straight-through gradient method, which enables back-propagation through the quantization mapping. Despite its empirical success, little is understood about why the straight-through gradient method works. Building upon a novel observation that the straight-through gradient method is in fact identical to the well-known Nesterov's dual-averaging algorithm on a quantization constrained optimization problem, we propose a more principled alternative approach, called ProxQuant, that formulates quantized network training as a regularized learning problem instead and optimizes it via the prox-gradient method. ProxQuant does back-propagation on the underlying…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Sparse and Compressive Sensing Techniques
