A Framework to Map VMAF with the Probability of Just Noticeable Difference between Video Encoding Recipes
Jingwen Zhu, Suiyi Ling, Yoann Baveye, Patrick Le Callet

TL;DR
This paper introduces a novel, efficient framework that maps VMAF score differences to the probability of JND between videos encoded with different recipes, independent of codecs, reducing the need for costly subjective testing.
Contribution
The proposed model decouples encoding recipes from JND estimation, using objective VQA scores to predict perceptual differences, enabling codec-agnostic and computationally efficient JND estimation.
Findings
Model accurately estimates JND probabilities across various codecs.
Framework reduces reliance on expensive subjective JND datasets.
Demonstrates efficiency and effectiveness through extensive experiments.
Abstract
Just Noticeable Difference (JND) model developed based on Human Vision System (HVS) through subjective studies is valuable for many multimedia use cases. In the streaming industries, it is commonly applied to reach a good balance between compression efficiency and perceptual quality when selecting video encoding recipes. Nevertheless, recent state-of-the-art deep learning based JND prediction model relies on large-scale JND ground truth that is expensive and time consuming to collect. Most of the existing JND datasets contain limited number of contents and are limited to a certain codec (e.g., H264). As a result, JND prediction models that were trained on such datasets are normally not agnostic to the codecs. To this end, in order to decouple encoding recipes and JND estimation, we propose a novel framework to map the difference of objective Video Quality Assessment (VQA) scores, i.e.,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Visual Attention and Saliency Detection · Video Coding and Compression Technologies
