On the impact of the parametrization of deep convolutional neural networks on post-training quantization

Samy Houache (IMB); Jean Fran\c{c}ois Aujol (UB; IMB); Yann Traonmilin (IMB)

arXiv:2502.01156·cs.IT·October 13, 2025

On the impact of the parametrization of deep convolutional neural networks on post-training quantization

Samy Houache (IMB), Jean Fran\c{c}ois Aujol (UB, IMB), Yann Traonmilin (IMB)

PDF

Open Access

TL;DR

This paper develops new theoretical bounds for the output approximation error of quantized CNNs, showing significant improvements over existing results, especially for deep networks like MobileNetV2 and ResNets.

Contribution

It introduces novel approximation bounds for quantized CNNs that significantly outperform previous bounds, especially considering layerwise parametrization and depth effects.

Findings

01

Bounds improve with network depth, reducing quantization error.

02

Numerical experiments validate the theoretical bounds on MobileNetV2 and ResNets.

03

Significant order-of-magnitude gains over state-of-the-art results.

Abstract

This paper introduces novel theoretical approximation bounds for the output of quantized neural networks, with a focus on convolutional neural networks (CNN). By considering layerwise parametrization and focusing on the quantization of weights, we provide bounds that gain several orders of magnitude compared to state-of-the-art results on classical deep convolutional neural networks such as MobileNetV2 or ResNets. These gains are achieved by improving the behaviour of the approximation bounds with respect to the depth parameter, which has the most impact on the approximation error induced by quantization. To complement our theoretical result, we provide a numerical exploration of our bounds on MobileNetV2 and ResNets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI