HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks
Zhen Dong, Zhewei Yao, Yaohui Cai, Daiyaan Arfeen, Amir Gholami,, Michael W. Mahoney, Kurt Keutzer

TL;DR
This paper introduces HAWQV2, an advanced Hessian-based method for mixed-precision neural network quantization that improves sensitivity measurement, automates precision selection, and extends to activation quantization, leading to state-of-the-art results.
Contribution
HAWQV2 improves upon prior work by using the full Hessian spectrum for sensitivity, automating mixed-precision bit selection, and including activation quantization analysis.
Findings
Achieves state-of-the-art quantization results across tasks.
Improves sensitivity measurement with average Hessian eigenvalues.
Extends quantization to activation layers, benefiting object detection.
Abstract
Quantization is an effective method for reducing memory footprint and inference time of Neural Networks, e.g., for efficient inference in the cloud, especially at the edge. However, ultra low precision quantization could lead to significant degradation in model generalization. A promising method to address this is to perform mixed-precision quantization, where more sensitive layers are kept at higher precision. However, the search space for a mixed-precision quantization is exponential in the number of layers. Recent work has proposed HAWQ, a novel Hessian based framework, with the aim of reducing this exponential search space by using second-order information. While promising, this prior work has three major limitations: (i) HAWQV1 only uses the top Hessian eigenvalue as a measure of sensitivity and do not consider the rest of the Hessian spectrum; (ii) HAWQV1 approach only provides…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
