Oscillations Make Neural Networks Robust to Quantization

Jonathan Wensh{\o}j; Bob Pepin; Raghavendra Selvan

arXiv:2502.00490·cs.LG·December 10, 2025

Oscillations Make Neural Networks Robust to Quantization

Jonathan Wensh{\o}j, Bob Pepin, Raghavendra Selvan

PDF

Open Access

TL;DR

This paper reveals that weight oscillations during Quantization Aware Training are crucial for model robustness, and introduces a regularizer to induce such oscillations, improving post-training quantization performance.

Contribution

It provides a theoretical analysis of oscillations in QAT, introduces a regularizer to induce oscillations, and demonstrates improved quantization results on neural networks.

Findings

01

Oscillations are essential for effective QAT.

02

Training with induced oscillations matches QAT performance.

03

Regularizer improves post-training quantization results.

Abstract

We challenge the prevailing view that weight oscillations observed during Quantization Aware Training (QAT) are merely undesirable side-effects and argue instead that they are an essential part of QAT. We show in a univariate linear model that QAT results in an additional loss term that causes oscillations by pushing weights away from their nearest quantization level. Based on the mechanism from the analysis, we then derive a regularizer that induces oscillations in the weights of neural networks during training. Our empirical results on ResNet-18 and Tiny Vision Transformer, evaluated on CIFAR-10 and Tiny ImageNet datasets, demonstrate across a range of quantization levels that training with oscillations followed by post-training quantization (PTQ) is sufficient to recover the performance of QAT in most cases. With this work we provide further insight into the dynamics of QAT and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications