Bounds on Agreement between Subjective and Objective Measurements
Jaden Pieper, Stephen D. Voran

TL;DR
This paper derives theoretical bounds on the correlation and error metrics between subjective and objective multimedia quality assessments, accounting for subjective test noise and vote variance, with models that match empirical data.
Contribution
It introduces a novel framework for bounding PCC and MSE in subjective tests, including a binomial-based vote model (BinoVotes) and a mean opinion score model (BinoMOS), to set realistic expectations.
Findings
Bounds align well with empirical data from 18 tests.
Vote variance information enables easy calculation of bounds.
Models provide expectations for subjective-objective agreement metrics.
Abstract
Objective estimators of multimedia quality are often judged by comparing estimates with subjective "truth data," most often via Pearson correlation coefficient (PCC) or mean-squared error (MSE). But subjective test results contain noise, so striving for a PCC of 1.0 or an MSE of 0.0 is neither realistic nor repeatable. Numerous efforts have been made to acknowledge and appropriately accommodate subjective test noise in objective-subjective comparisons, typically resulting in new analysis frameworks and figures-of-merit. We take a different approach. By making only basic assumptions, we derive bounds on PCC and MSE that can be expected for a subjective test. Consistent with intuition, these bounds are functions of subjective vote variance. When a subjective test includes vote variance information, the calculation of the bounds is easy, and in this case we say the resulting bounds are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Network Traffic and Congestion Control · Video Analysis and Summarization
