EMQ: Evolving Training-free Proxies for Automated Mixed Precision Quantization
Peijie Dong, Lujun Li, Zimian Wei, Xin Niu, Zhiliang Tian, and Hengyue Pan

TL;DR
This paper introduces EMQ, an automated framework that evolves training-free proxies for mixed-precision quantization, significantly improving search efficiency and accuracy without heavy tuning or expert knowledge.
Contribution
The paper presents an automatic proxy search framework using evolving algorithms to discover highly correlated proxies for mixed-precision quantization.
Findings
EMQ outperforms state-of-the-art methods on ImageNet.
The evolved proxies achieve better correlation with quantization accuracy.
The framework reduces search cost and complexity.
Abstract
Mixed-Precision Quantization~(MQ) can achieve a competitive accuracy-complexity trade-off for models. Conventional training-based search methods require time-consuming candidate training to search optimized per-layer bit-width configurations in MQ. Recently, some training-free approaches have presented various MQ proxies and significantly improve search efficiency. However, the correlation between these proxies and quantization accuracy is poorly understood. To address the gap, we first build the MQ-Bench-101, which involves different bit configurations and quantization results. Then, we observe that the existing training-free proxies perform weak correlations on the MQ-Bench-101. To efficiently seek superior proxies, we develop an automatic search of proxies framework for MQ via evolving algorithms. In particular, we devise an elaborate search space involving the existing proxies and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
EMQ: Evolving Training-free Proxies for Automated Mixed Precision Quantization· youtube
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Processing Techniques and Applications · Medical Image Segmentation Techniques
MethodsAverage Pooling · Batch Normalization · 1x1 Convolution · Max Pooling · Residual Connection · Residual Block · Kaiming Initialization · Global Average Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · Bottleneck Residual Block
