A Rate-Quality Model for Learned Video Coding
Sang NguyenQuang, Cheng-Wei Chen, Xiem HoangVan, Wen-Hsiao Peng

TL;DR
This paper introduces RQNet, a neural network-based model that accurately predicts the rate-quality relationship in learned video coding, enabling real-time adaptation and improved coding efficiency.
Contribution
The paper proposes a neural network model for the rate-quality relationship in learned video coding, allowing on-the-fly parameter estimation for better accuracy and adaptability.
Findings
Achieves significantly smaller bitrate deviations than baseline methods.
Enables online adaptation of the R-Q model for improved flexibility.
Maintains minimal additional complexity while enhancing prediction accuracy.
Abstract
Learned video coding (LVC) has recently achieved superior coding performance. In this paper, we model the rate-quality (R-Q) relationship for learned video coding by a parametric function. We learn a neural network, termed RQNet, to characterize the relationship between the bitrate and quality level according to video content and coding context. The predicted (R,Q) results are further integrated with those from previously coded frames using the least-squares method to determine the parameters of our R-Q model on-the-fly. Compared to the conventional approaches, our method accurately estimates the R-Q relationship, enabling the online adaptation of model parameters to enhance both flexibility and precision. Experimental results show that our R-Q model achieves significantly smaller bitrate deviations than the baseline method on commonly used datasets with minimal additional complexity.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Video Coding and Compression Technologies · Image and Video Quality Assessment
