PowerQuant: Automorphism Search for Non-Uniform Quantization
Edouard Yvinec, Arnaud Dapogny, Matthieu Cord, Kevin Bailly

TL;DR
PowerQuant introduces a data-free, non-uniform quantization method for deep neural networks by optimizing power functions, enabling improved accuracy without hardware changes or significant overhead.
Contribution
The paper proposes PowerQuant, a novel non-uniform quantization technique based on automorphism search, which enhances DNN compression without hardware modifications or data dependence.
Findings
Outperforms existing quantization methods across various configurations.
Requires only minor modifications to activation functions.
Achieves significant accuracy improvements with negligible overhead.
Abstract
Deep neural networks (DNNs) are nowadays ubiquitous in many domains such as computer vision. However, due to their high latency, the deployment of DNNs hinges on the development of compression techniques such as quantization which consists in lowering the number of bits used to encode the weights and activations. Growing concerns for privacy and security have motivated the development of data-free techniques, at the expanse of accuracy. In this paper, we identity the uniformity of the quantization operator as a limitation of existing approaches, and propose a data-free non-uniform method. More specifically, we argue that to be readily usable without dedicated hardware and implementation, non-uniform quantization shall not change the nature of the mathematical operations performed by the DNN. This leads to search among the continuous automorphisms of , which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMedical Imaging Techniques and Applications · Sparse and Compressive Sensing Techniques · Advanced Neural Network Applications
