Label Noise-Robust Learning using a Confidence-Based Sieving Strategy
Reihaneh Torkzadehmahani, Reza Nasirigerdeh, Daniel Rueckert, Georgios, Kaissis

TL;DR
This paper introduces CONFES, a confidence-based sieving strategy that effectively distinguishes clean from noisy labels, enhancing model robustness in noisy label learning scenarios, with theoretical guarantees and empirical validation.
Contribution
The paper proposes a novel confidence error metric and CONFES strategy for noise-robust learning, with theoretical error bounds and compatibility with existing methods.
Findings
CONFES outperforms recent methods on synthetic and real-world noise datasets.
The confidence error metric provides reliable noise detection with theoretical guarantees.
Combining CONFES with other approaches further improves robustness.
Abstract
In learning tasks with label noise, improving model robustness against overfitting is a pivotal challenge because the model eventually memorizes labels, including the noisy ones. Identifying the samples with noisy labels and preventing the model from learning them is a promising approach to address this challenge. When training with noisy labels, the per-class confidence scores of the model, represented by the class probabilities, can be reliable criteria for assessing whether the input label is the true label or the corrupted one. In this work, we exploit this observation and propose a novel discriminator metric called confidence error and a sieving strategy called CONFES to differentiate between the clean and noisy samples effectively. We provide theoretical guarantees on the probability of error for our proposed metric. Then, we experimentally illustrate the superior performance of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Water Systems and Optimization · Music and Audio Processing
