On Quantizing Neural Representation for Variable-Rate Video Coding

Junqi Shi; Zhujia Chen; Hanfei Li; Qi Zhao; Ming Lu; Tong Chen; Zhan; Ma

arXiv:2502.11729·eess.IV·February 18, 2025·ICLR

On Quantizing Neural Representation for Variable-Rate Video Coding

Junqi Shi, Zhujia Chen, Hanfei Li, Qi Zhao, Ming Lu, Tong Chen, Zhan, Ma

PDF

Open Access 1 Repo 3 Reviews

TL;DR

NeuroQuant is a novel post-training quantization method for variable-rate neural video coding that adjusts pre-trained weights' quantization parameters to achieve efficient compression and fast encoding without retraining.

Contribution

This paper introduces the first variable-rate INR-VC method with a theoretical framework for sensitivity-based quantization and novel calibration strategies, improving compression and speed.

Findings

01

Outperforms existing quantization techniques in variable bitwidths

02

Enables encoding acceleration up to eight times

03

Achieves minimal loss with INT2 quantization

Abstract

This work introduces NeuroQuant, a novel post-training quantization (PTQ) approach tailored to non-generalized Implicit Neural Representations for variable-rate Video Coding (INR-VC). Unlike existing methods that require extensive weight retraining for each target bitrate, we hypothesize that variable-rate coding can be achieved by adjusting quantization parameters (QPs) of pre-trained weights. Our study reveals that traditional quantization methods, which assume inter-layer independence, are ineffective for non-generalized INR-VC models due to significant dependencies across layers. To address this, we redefine variable-rate INR-VC as a mixed-precision quantization problem and establish a theoretical framework for sensitivity criteria aimed at simplified, fine-grained rate control. Additionally, we propose network-wise calibration and channel-wise quantization strategies to minimize…

Peer Reviews

Decision·ICLR 2025 Spotlight

Reviewer 01Rating 8Confidence 4

Strengths

1) Using one single model for different bit rates with post-training quantization is interesting. This alleviates the need to train a model for each bit-rate, this will decrease the training time. 2) The paper provides the mathematical insights to their proposed method, inspired from the Nagel et. al (2020), and formulates the post-training quantization objective with respect to the network calibration. 3) The experimental results show that the proposed method has a significant gain in the va

Weaknesses

1. The authors failed to compare their proposed approach with Neural Network Coding tool (NNC) [1] which also performs post-training quantization, and also can offer variable-bitrate coding by adjusting QP parameters. The authors should compare their method with NNC. [1] S. Wiedemann et al., "DeepCABAC: A Universal Compression Algorithm for Deep Neural Networks," in IEEE Journal of Selected Topics in Signal Processing, vol. 14, no. 4, pp. 700-714, May 2020 https://arxiv.o

Reviewer 02Rating 6Confidence 3

Strengths

1. The proposed method achieves variable-rate coding by adjusting QPs of pre-trained weights, eliminating the need for repeated model training for each target rate, which significantly reduces encoding time and complexity. 2. The method demonstrates superior performance in compression efficiency, outperforming competitors and enabling quantization down to INT2 without notable performance degradation. 3. The paper proposes a unified formula for representation-oriented PTQ calibration, streamlin

Weaknesses

N/A Actually, I am not very familiar with this field, so please have AE consider the opinions of other reviewers more.

Reviewer 03Rating 6Confidence 4

Strengths

- The results look promising. Although NeuroQuant achieves only a marginal improvement over the current best INR-VC (-4.8%), it provides greater efficiency in obtaining multiple rate points. - The experiments comparing different quantization methods are comprehensive, which will be helpful for future work in this area.

Weaknesses

- In Table 2, excluding the pretraining time for NeuroQuant does not seem appropriate. Even with NeuroQuant, pretraining is still required, and the current presentation may be misleading. The authors should consider reporting the pretraining and fine-tuning times separately for both the baseline models and NeuroQuant. - Similarly, the claim of an 8x encoding speedup is also misleading, as it excludes the pretraining time required for INR-VC encoding (even though NeuroQuant avoids full retraining

Code & Models

Repositories

eric-qi/neuroquant
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Compression Techniques · Video Coding and Compression Technologies · Advanced Image Processing Techniques