Label Noise-Robust Learning using a Confidence-Based Sieving Strategy

Reihaneh Torkzadehmahani; Reza Nasirigerdeh; Daniel Rueckert; Georgios; Kaissis

arXiv:2210.05330·cs.LG·September 28, 2023

Label Noise-Robust Learning using a Confidence-Based Sieving Strategy

Reihaneh Torkzadehmahani, Reza Nasirigerdeh, Daniel Rueckert, Georgios, Kaissis

PDF

Open Access

TL;DR

This paper introduces CONFES, a confidence-based sieving strategy that effectively distinguishes clean from noisy labels, enhancing model robustness in noisy label learning scenarios, with theoretical guarantees and empirical validation.

Contribution

The paper proposes a novel confidence error metric and CONFES strategy for noise-robust learning, with theoretical error bounds and compatibility with existing methods.

Findings

01

CONFES outperforms recent methods on synthetic and real-world noise datasets.

02

The confidence error metric provides reliable noise detection with theoretical guarantees.

03

Combining CONFES with other approaches further improves robustness.

Abstract

In learning tasks with label noise, improving model robustness against overfitting is a pivotal challenge because the model eventually memorizes labels, including the noisy ones. Identifying the samples with noisy labels and preventing the model from learning them is a promising approach to address this challenge. When training with noisy labels, the per-class confidence scores of the model, represented by the class probabilities, can be reliable criteria for assessing whether the input label is the true label or the corrupted one. In this work, we exploit this observation and propose a novel discriminator metric called confidence error and a sieving strategy called CONFES to differentiate between the clean and noisy samples effectively. We provide theoretical guarantees on the probability of error for our proposed metric. Then, we experimentally illustrate the superior performance of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Water Systems and Optimization · Music and Audio Processing