Label Convergence: Defining an Upper Performance Bound in Object   Recognition through Contradictory Annotations

David Tschirschwitz; Volker Rodehorst

arXiv:2409.09412·cs.CV·January 23, 2025

Label Convergence: Defining an Upper Performance Bound in Object Recognition through Contradictory Annotations

David Tschirschwitz, Volker Rodehorst

PDF

Open Access 1 Repo

TL;DR

This paper introduces the concept of label convergence to define an upper bound on object recognition performance considering annotation contradictions, highlighting the impact of label noise on model evaluation and suggesting directions for future data quality improvements.

Contribution

It formalizes the notion of label convergence as an upper bound on model accuracy in the presence of contradictory annotations, supported by analysis of real-world datasets including LVIS.

Findings

01

Label convergence for LVIS is approximately 62.63-67.52 mAP.

02

State-of-the-art models reach the upper end of the label convergence interval.

03

Annotation errors significantly influence achievable performance bounds.

Abstract

Annotation errors are a challenge not only during training of machine learning models, but also during their evaluation. Label variations and inaccuracies in datasets often manifest as contradictory examples that deviate from established labeling conventions. Such inconsistencies, when significant, prevent models from achieving optimal performance on metrics such as mean Average Precision (mAP). We introduce the notion of "label convergence" to describe the highest achievable performance under the constraint of contradictory test annotations, essentially defining an upper bound on model accuracy. Recognizing that noise is an inherent characteristic of all data, our study analyzes five real-world datasets, including the LVIS dataset, to investigate the phenomenon of label convergence. We approximate that label convergence is between 62.63-67.52 mAP@[0.5:0.95:0.05] for LVIS with 95%…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Madave94/multi-annotator-toolkit
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Machine Learning and Data Classification · Handwritten Text Recognition Techniques

MethodsFocus