Efficient Multi-bit Quantization Network Training via Weight Bias Correction and Bit-wise Coreset Sampling

Jinhee Kim; Jae Jun An; Kang Eun Jeon; Jong Hwan Ko

arXiv:2510.20673·cs.CV·October 24, 2025

Efficient Multi-bit Quantization Network Training via Weight Bias Correction and Bit-wise Coreset Sampling

Jinhee Kim, Jae Jun An, Kang Eun Jeon, Jong Hwan Ko

PDF

Open Access

TL;DR

This paper introduces techniques to efficiently train multi-bit quantization networks by correcting weight biases and sampling informative data subsets, significantly reducing training time while maintaining high accuracy.

Contribution

The authors propose weight bias correction and bit-wise coreset sampling methods to cut training overhead in multi-bit quantization networks without sacrificing performance.

Findings

01

Achieves up to 7.88x reduction in training time.

02

Maintains or improves accuracy across multiple datasets and architectures.

03

Eliminates the need for fine-tuning for different bit-widths.

Abstract

Multi-bit quantization networks enable flexible deployment of deep neural networks by supporting multiple precision levels within a single model. However, existing approaches suffer from significant training overhead as full-dataset updates are repeated for each supported bit-width, resulting in a cost that scales linearly with the number of precisions. Additionally, extra fine-tuning stages are often required to support additional or intermediate precision options, further compounding the overall training burden. To address this issue, we propose two techniques that greatly reduce the training overhead without compromising model utility: (i) Weight bias correction enables shared batch normalization and eliminates the need for fine-tuning by neutralizing quantization-induced bias across bit-widths and aligning activation distributions; and (ii) Bit-wise coreset sampling strategy allows…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning