ProxQuant: Quantized Neural Networks via Proximal Operators

Yu Bai; Yu-Xiang Wang; Edo Liberty

arXiv:1810.00861·cs.LG·March 6, 2019·35 cites

ProxQuant: Quantized Neural Networks via Proximal Operators

Yu Bai, Yu-Xiang Wang, Edo Liberty

PDF

Open Access 1 Repo

TL;DR

ProxQuant introduces a principled method for training quantized neural networks using proximal operators, outperforming traditional straight-through gradient methods in stability and effectiveness, especially for binary quantization.

Contribution

It formulates quantized network training as a regularized optimization problem and applies the prox-gradient method, providing a more stable and theoretically grounded alternative to straight-through gradients.

Findings

01

ProxQuant outperforms state-of-the-art on binary quantization of ResNets and LSTMs.

02

ProxQuant achieves comparable results to state-of-the-art on multi-bit quantization.

03

The method is more stable than traditional straight-through gradient approaches.

Abstract

To make deep neural networks feasible in resource-constrained environments (such as mobile devices), it is beneficial to quantize models by using low-precision weights. One common technique for quantizing neural networks is the straight-through gradient method, which enables back-propagation through the quantization mapping. Despite its empirical success, little is understood about why the straight-through gradient method works. Building upon a novel observation that the straight-through gradient method is in fact identical to the well-known Nesterov's dual-averaging algorithm on a quantization constrained optimization problem, we propose a more principled alternative approach, called ProxQuant, that formulates quantized network training as a regularized learning problem instead and optimizes it via the prox-gradient method. ProxQuant does back-propagation on the underlying…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

allenbai01/ProxQuant
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Sparse and Compressive Sensing Techniques