LongReMix: Robust Learning with High Confidence Samples in a Noisy Label   Environment

Filipe R. Cordeiro; Ragav Sachdeva; Vasileios Belagiannis; Ian Reid,; Gustavo Carneiro

arXiv:2103.04173·cs.CV·September 7, 2022

LongReMix: Robust Learning with High Confidence Samples in a Noisy Label Environment

Filipe R. Cordeiro, Ragav Sachdeva, Vasileios Belagiannis, Ian Reid,, Gustavo Carneiro

PDF

Open Access 1 Repo

TL;DR

LongReMix is a new two-stage training algorithm that improves deep neural network robustness in high noise label environments by better classifying clean and noisy samples, leading to state-of-the-art results.

Contribution

It introduces LongReMix, a novel two-stage noisy-label learning algorithm that enhances generalization and performance in high noise scenarios.

Findings

01

LongReMix outperforms existing methods on multiple noisy-label benchmarks.

02

It achieves state-of-the-art results in most tested datasets.

03

The approach is particularly effective in high label noise conditions.

Abstract

Deep neural network models are robust to a limited amount of label noise, but their ability to memorise noisy labels in high noise rate problems is still an open issue. The most competitive noisy-label learning algorithms rely on a 2-stage process comprising an unsupervised learning to classify training samples as clean or noisy, followed by a semi-supervised learning that minimises the empirical vicinal risk (EVR) using a labelled set formed by samples classified as clean, and an unlabelled set with samples classified as noisy. In this paper, we hypothesise that the generalisation of such 2-stage noisy-label learning methods depends on the precision of the unsupervised classifier and the size of the training set to minimise the EVR. We empirically validate these two hypotheses and propose the new 2-stage noisy-label training algorithm LongReMix. We test LongReMix on the noisy-label…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

filipe-research/LongReMix
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Water Systems and Optimization · Text and Document Classification Technologies