LongReMix: Robust Learning with High Confidence Samples in a Noisy Label Environment
Filipe R. Cordeiro, Ragav Sachdeva, Vasileios Belagiannis, Ian Reid,, Gustavo Carneiro

TL;DR
LongReMix is a new two-stage training algorithm that improves deep neural network robustness in high noise label environments by better classifying clean and noisy samples, leading to state-of-the-art results.
Contribution
It introduces LongReMix, a novel two-stage noisy-label learning algorithm that enhances generalization and performance in high noise scenarios.
Findings
LongReMix outperforms existing methods on multiple noisy-label benchmarks.
It achieves state-of-the-art results in most tested datasets.
The approach is particularly effective in high label noise conditions.
Abstract
Deep neural network models are robust to a limited amount of label noise, but their ability to memorise noisy labels in high noise rate problems is still an open issue. The most competitive noisy-label learning algorithms rely on a 2-stage process comprising an unsupervised learning to classify training samples as clean or noisy, followed by a semi-supervised learning that minimises the empirical vicinal risk (EVR) using a labelled set formed by samples classified as clean, and an unlabelled set with samples classified as noisy. In this paper, we hypothesise that the generalisation of such 2-stage noisy-label learning methods depends on the precision of the unsupervised classifier and the size of the training set to minimise the EVR. We empirically validate these two hypotheses and propose the new 2-stage noisy-label training algorithm LongReMix. We test LongReMix on the noisy-label…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Water Systems and Optimization · Text and Document Classification Technologies
