K$\alpha$LOS finds Consensus: A Meta-Algorithm for Evaluating Inter-Annotator Agreement in Complex Vision Tasks
David Tschirschwitz, Volker Rodehorst

TL;DR
KαLOS is a unified, data-driven meta-algorithm that standardizes dataset quality evaluation in complex vision tasks by resolving spatial correspondence and calibrating agreement metrics, improving reliability and diagnostics.
Contribution
It introduces KαLOS, a principled framework that generalizes agreement evaluation across diverse vision tasks, addressing the limitations of existing heuristics and validation methods.
Findings
KαLOS effectively calibrates localization parameters to inherent agreement distributions.
The framework enables granular diagnostics like annotator vitality and localization sensitivity.
Empirical validation with a novel noise generator demonstrates robustness in distinguishing signal from noise.
Abstract
Progress in object detection benchmarks is stagnating. It is limited not by architectures but by the inability to distinguish model improvements from label noise. To restore trust in benchmarking the field requires rigorous quantification of annotation consistency to ensure the reliability of evaluation data. However, standard statistical metrics fail to handle the instance correspondence problem inherent to vision tasks. Furthermore, validating new agreement metrics remains circular because no objective ground truth for agreement exists. This forces reliance on unverifiable heuristics. We propose KLOS (KALOS), a unified meta-algorithm that generalizes the "Localization First" principle to standardize dataset quality evaluation. By resolving spatial correspondence before assessing agreement, our framework transforms complex spatio-categorical problems into nominal reliability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
