Describing Subjective Experiment Consistency by $p$-Value P-P Plot
Jakub Nawa{\l}a (1), Lucjan Janowski (1), Bogdan \'Cmiel (2),, Krzysztof Rusek (1) ((1) AGH University of Science, Technology, Department, of Telecommunications, (2) AGH University of Science, Technology,, Department of Mathematical Analysis, Computational Mathematics and

TL;DR
This paper introduces a statistical tool using $p$-value P-P plots based on the Generalized Score Distribution to classify the consistency of subjective experiment results, ensuring more reliable conclusions in multimedia quality assessment.
Contribution
The paper presents a novel method combining GSD and bootstrap G-test to visualize and classify subjective experiment consistency, including identifying irregular score distributions.
Findings
The tool effectively classifies experiment results as consistent or inconsistent.
It successfully identifies stimuli with irregular score distributions.
The approach aligns with expectations from real-life multimedia experiments.
Abstract
There are phenomena that cannot be measured without subjective testing. However, subjective testing is a complex issue with many influencing factors. These interplay to yield either precise or incorrect results. Researchers require a tool to classify results of subjective experiment as either consistent or inconsistent. This is necessary in order to decide whether to treat the gathered scores as quality ground truth data. Knowing if subjective scores can be trusted is key to drawing valid conclusions and building functional tools based on those scores (e.g., algorithms assessing the perceived quality of multimedia materials). We provide a tool to classify subjective experiment (and all its results) as either consistent or inconsistent. Additionally, the tool identifies stimuli having irregular score distribution. The approach is based on treating subjective scores as a random variable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
