Efficiently Training A Flat Neural Network Before It has been Quantizated

Peng Xia; Junbiao Pang; Tianyang Cai

arXiv:2511.01462·cs.CV·November 4, 2025

Efficiently Training A Flat Neural Network Before It has been Quantizated

Peng Xia, Junbiao Pang, Tianyang Cai

PDF

Open Access

TL;DR

This paper introduces a framework for pre-training neural networks to be more suitable for low-bit quantization, reducing quantization errors and improving post-training quantization efficiency.

Contribution

It proposes a novel approach to pre-condition neural networks by modeling quantization errors as Gaussian noises and optimizing for flat minima, enhancing low-bit PTQ performance.

Findings

01

Model-agnostic neural networks can be pre-trained for low-bit quantization.

02

Gaussian noise modeling improves quantization error understanding.

03

Experimental results show significant reduction in quantization errors.

Abstract

Post-training quantization (PTQ) for vision transformers (ViTs) has garnered significant attention due to its efficiency in compressing models. However, existing methods typically overlook the relationship between a well-trained NN and the quantized model, leading to considerable quantization error for PTQ. However, it is unclear how to efficiently train a model-agnostic neural network which is tailored for a predefined precision low-bit model. In this paper, we firstly discover that a flat full precision neural network is crucial for low-bit quantization. To achieve this, we propose a framework that proactively pre-conditions the model by measuring and disentangling the error sources. Specifically, both the Activation Quantization Error (AQE) and the Weight Quantization Error (WQE) are statistically modeled as independent Gaussian noises. We study several noise injection optimization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCCD and CMOS Imaging Sensors · Advanced Memory and Neural Computing · Infrared Target Detection Methodologies