Beyond Correlation: Evaluating Multimedia Quality Models with the   Constrained Concordance Index

Alessandro Ragano; Helard Becerra Martinez; Andrew Hines

arXiv:2411.05794·cs.MM·November 12, 2024

Beyond Correlation: Evaluating Multimedia Quality Models with the Constrained Concordance Index

Alessandro Ragano, Helard Becerra Martinez, Andrew Hines

PDF

Open Access

TL;DR

This paper introduces the Constrained Concordance Index (CCI), a new metric for evaluating multimedia quality models that accounts for subjective rating uncertainties and statistical significance, improving evaluation accuracy.

Contribution

The paper proposes the CCI metric, addressing limitations of traditional correlation measures by incorporating significance testing and confidence interval considerations.

Findings

01

CCI outperforms traditional metrics in low sample scenarios

02

Incorporating rater variability improves evaluation robustness

03

Focusing on significant MOS differences enhances model assessment

Abstract

This study investigates the evaluation of multimedia quality models, focusing on the inherent uncertainties in subjective Mean Opinion Score (MOS) ratings due to factors like rater inconsistency and bias. Traditional statistical measures such as Pearson's Correlation Coefficient (PCC), Spearman's Rank Correlation Coefficient (SRCC), and Kendall's Tau (KTAU) often fail to account for these uncertainties, leading to inaccuracies in model performance assessment. We introduce the Constrained Concordance Index (CCI), a novel metric designed to overcome the limitations of existing metrics by considering the statistical significance of MOS differences and excluding comparisons where MOS confidence intervals overlap. Through comprehensive experiments across various domains including speech and image quality assessment, we demonstrate that CCI provides a more robust and accurate evaluation of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Video Quality Assessment