FP=xINT:Representing Neural Networks via Low-Bit Series Basis Functions

Boyang Zhang; Daning Cheng; Yunquan Zhang; Jiake Tian; Jing Li; Fangming Liu

arXiv:2412.06865·cs.LG·December 24, 2025

FP=xINT:Representing Neural Networks via Low-Bit Series Basis Functions

Boyang Zhang, Daning Cheng, Yunquan Zhang, Jiake Tian, Jing Li, Fangming Liu

PDF

Open Access

TL;DR

This paper introduces FP=xINT, a novel series expansion framework for neural network quantization that accurately approximates full-precision models using low-bit basis models, improving performance at extremely low bit settings.

Contribution

First application of series expansion to neural network quantization, enabling rapid, calibration-free approximation of full-precision models with theoretical convergence guarantees.

Findings

01

Achieves state-of-the-art low-bit quantization performance

02

4-bit ResNet-50 surpasses original accuracy, reaching 77.03%

03

Ensures operation parallelism and model accuracy restoration

Abstract

Post-Training Quantization (PTQ) converts pre-trained Full-Precision (FP) models into quantized versions without training. While existing methods reduce size and computational costs, they also significantly degrade performance and quantization efficiency at extremely low settings due to quantization noise. We introduce a deep model series expansion framework to address this issue, enabling rapid and accurate approximation of unquantized models without calibration sets or fine-tuning. This is the first use of series expansion for neural network quantization. Specifically, our method expands the FP model into multiple low-bit basis models. To ensure accurate quantization, we develop low-bit basis model expansions at different granularities (tensor, layer, model), and theoretically confirm their convergence to the dense model, thus restoring FP model accuracy. Additionally, we design…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Compression Techniques · Image and Signal Denoising Methods · Digital Filter Design and Implementation