Rate-Distortion Optimization for Ensembles of Non-Reference Metrics
Xin Xiong, Samuel Fern\'andez-Mendui\~na, Eduardo Pavez, Antonio Ortega, Neil Birkbeck, Balu Adsumilli

TL;DR
This paper introduces a robust rate-distortion optimization method that combines multiple non-reference image quality metrics into an ensemble, stabilizing their gradients to improve video encoding efficiency without increasing decoder complexity.
Contribution
It extends linearization of non-reference metrics to ensembles and incorporates smoothing to enhance robustness, reducing complexity and improving quality in hybrid video codecs.
Findings
Achieves consistent bitrate savings across multiple NRMs.
Reduces encoding runtime significantly compared to direct NRM optimization.
No additional decoder complexity introduced.
Abstract
Non-reference metrics (NRMs) can assess the visual quality of images and videos without a reference, making them well-suited for the evaluation of user-generated content. Nonetheless, rate-distortion optimization (RDO) in video coding is still mainly driven by full-reference metrics, such as the sum of squared errors, which treat the input as an ideal target. A way to incorporate NRMs into RDO is through linearization (LNRM), where the gradient of the NRM with respect to the input guides bit allocation. While this strategy improves the quality predicted by some metrics, we show that it can yield limited gains or degradations when evaluated with other NRMs. We argue that NRMs are highly non-linear predictors with locally unstable gradients that can compromise the quality of the linearization; furthermore, optimizing a single metric may exploit model-specific biases that do not generalize…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Video Coding and Compression Technologies · Visual Attention and Saliency Detection
