Data-Free Quantization via Mixed-Precision Compensation without Fine-Tuning
Jun Chen, Shipeng Bai, Tianxin Huang, Mengmeng Wang, Guanzhong Tian,, Yong Liu

TL;DR
This paper introduces a data-free mixed-precision compensation method that restores the accuracy of ultra-low precision neural network models without requiring data or fine-tuning, using a mathematically derived reconstruction approach.
Contribution
It proposes a novel data-free approach for ultra-low precision quantization that leverages layer-wise mixed-precision reconstruction to improve accuracy without data or fine-tuning.
Findings
Achieves higher accuracy than recent data-free quantization methods.
Does not require original or synthetic data for model recovery.
Provides a mathematically derived closed-form solution for reconstruction.
Abstract
Neural network quantization is a very promising solution in the field of model compression, but its resulting accuracy highly depends on a training/fine-tuning process and requires the original data. This not only brings heavy computation and time costs but also is not conducive to privacy and sensitive information protection. Therefore, a few recent works are starting to focus on data-free quantization. However, data-free quantization does not perform well while dealing with ultra-low precision quantization. Although researchers utilize generative methods of synthetic data to address this problem partially, data synthesis needs to take a lot of computation and time. In this paper, we propose a data-free mixed-precision compensation (DF-MPC) method to recover the performance of an ultra-low precision quantized model without any data and fine-tuning process. By assuming the quantized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSeismic Imaging and Inversion Techniques · Image and Signal Denoising Methods · Advanced Image Processing Techniques
MethodsFocus
