Learning From Noisy Labels By Regularized Estimation Of Annotator Confusion
Ryutaro Tanno, Ardavan Saeedi, Swami Sankaranarayanan, Daniel C., Alexander, Nathan Silberman

TL;DR
This paper introduces a regularized estimation method for learning true labels from noisy, multi-annotator data, effectively modeling individual annotator confusion and improving classification accuracy in noisy label scenarios.
Contribution
The paper proposes a novel regularization-based approach to jointly estimate annotator confusion matrices and true labels, enhancing learning from noisy annotations.
Findings
Outperforms or matches state-of-the-art methods on image classification tasks.
Capable of estimating annotator skills with only one label per image.
Effective in both simulated and real noisy label settings.
Abstract
The predictive performance of supervised learning algorithms depends on the quality of labels. In a typical label collection process, multiple annotators provide subjective noisy estimates of the "truth" under the influence of their varying skill-levels and biases. Blindly treating these noisy labels as the ground truth limits the accuracy of learning algorithms in the presence of strong disagreement. This problem is critical for applications in domains such as medical imaging where both the annotation cost and inter-observer variability are high. In this work, we present a method for simultaneously learning the individual annotator model and the underlying true label distribution, using only noisy observations. Each annotator is modeled by a confusion matrix that is jointly estimated along with the classifier predictions. We propose to add a regularization term to the loss function…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications · Machine Learning and Algorithms
