FP=xINT:Representing Neural Networks via Low-Bit Series Basis Functions
Boyang Zhang, Daning Cheng, Yunquan Zhang, Jiake Tian, Jing Li, Fangming Liu

TL;DR
This paper introduces FP=xINT, a novel series expansion framework for neural network quantization that accurately approximates full-precision models using low-bit basis models, improving performance at extremely low bit settings.
Contribution
First application of series expansion to neural network quantization, enabling rapid, calibration-free approximation of full-precision models with theoretical convergence guarantees.
Findings
Achieves state-of-the-art low-bit quantization performance
4-bit ResNet-50 surpasses original accuracy, reaching 77.03%
Ensures operation parallelism and model accuracy restoration
Abstract
Post-Training Quantization (PTQ) converts pre-trained Full-Precision (FP) models into quantized versions without training. While existing methods reduce size and computational costs, they also significantly degrade performance and quantization efficiency at extremely low settings due to quantization noise. We introduce a deep model series expansion framework to address this issue, enabling rapid and accurate approximation of unquantized models without calibration sets or fine-tuning. This is the first use of series expansion for neural network quantization. Specifically, our method expands the FP model into multiple low-bit basis models. To ensure accurate quantization, we develop low-bit basis model expansions at different granularities (tensor, layer, model), and theoretically confirm their convergence to the dense model, thus restoring FP model accuracy. Additionally, we design…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Image and Signal Denoising Methods · Digital Filter Design and Implementation
