Manifold DivideMix: A Semi-Supervised Contrastive Learning Framework for Severe Label Noise
Fahimeh Fooladgar, Minh Nguyen Nhat To, Parvin Mousavi, Purang, Abolmaesumi

TL;DR
This paper introduces Manifold DivideMix, a semi-supervised contrastive learning framework that effectively handles severe label noise by leveraging self-supervised training, out-of-distribution sample removal, and mixup augmentation to improve neural network robustness.
Contribution
It proposes a novel framework combining self-supervised learning, out-of-distribution sample removal, and mixup augmentation for training neural networks with noisy labels.
Findings
Effective removal of out-of-distribution noisy samples.
Improved accuracy on synthetic and real-world noisy datasets.
Robust semi-supervised training with enhanced representations.
Abstract
Deep neural networks have proven to be highly effective when large amounts of data with clean labels are available. However, their performance degrades when training data contains noisy labels, leading to poor generalization on the test set. Real-world datasets contain noisy label samples that either have similar visual semantics to other classes (in-distribution) or have no semantic relevance to any class (out-of-distribution) in the dataset. Most state-of-the-art methods leverage ID labeled noisy samples as unlabeled data for semi-supervised learning, but OOD labeled noisy samples cannot be used in this way because they do not belong to any class within the dataset. Hence, in this paper, we propose incorporating the information from all the training data by leveraging the benefits of self-supervised training. Our method aims to extract a meaningful and generalizable embedding space…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Image Enhancement Techniques · Infrastructure Maintenance and Monitoring
MethodsMixup
