Rho-Perfect: Correlation Ceiling For Subjective Evaluation Datasets

Fredrik Cumlin

arXiv:2602.08552·cs.LG·February 10, 2026

Rho-Perfect: Correlation Ceiling For Subjective Evaluation Datasets

Fredrik Cumlin

PDF

Open Access

TL;DR

This paper introduces $ ho$-Perfect, a method to estimate the maximum possible correlation between models and human ratings in subjective datasets, accounting for inherent noise and data reliability.

Contribution

It provides a practical estimation technique for the correlation ceiling in subjective datasets, addressing the impact of heteroscedastic noise on model-human correlation.

Findings

01

$ ho$-Perfect accurately estimates the correlation ceiling in subjective datasets.

02

The method distinguishes between model limitations and data quality issues.

03

Application to speech quality data demonstrates its practical utility.

Abstract

Subjective ratings contain inherent noise that limits the model-human correlation, but this reliability issue is rarely quantified. In this paper, we present $ρ$ -Perfect, a practical estimation of the highest achievable correlation of a model on subjectively rated datasets. We define $ρ$ -Perfect to be the correlation between a perfect predictor and human ratings, and derive an estimate of the value based on heteroscedastic noise scenarios, a common occurrence in subjectively rated datasets. We show that $ρ$ -Perfect squared estimates test-retest correlation and use this to validate the estimate. We demonstrate the use of $ρ$ -Perfect on a speech quality dataset and show how the measure can distinguish between model limitations and data quality issues.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Face recognition and analysis