Label Noise Types and Their Effects on Deep Learning
G\"orkem Algan, \.Ilkay Ulusoy

TL;DR
This paper analyzes how different types of label noise affect deep learning and introduces a framework to generate challenging feature-dependent noisy labels, aiding fair comparison of noise-robust algorithms.
Contribution
It provides a detailed analysis of label noise effects and proposes a generic method to generate feature-dependent label noise for benchmarking.
Findings
Different label noise types impact learning distinctly.
The proposed feature-dependent noise generation is highly challenging for models.
Shared noisy datasets facilitate fair evaluation of noise-robust methods.
Abstract
The recent success of deep learning is mostly due to the availability of big datasets with clean annotations. However, gathering a cleanly annotated dataset is not always feasible due to practical challenges. As a result, label noise is a common problem in datasets, and numerous methods to train deep neural networks in the presence of noisy labels are proposed in the literature. These methods commonly use benchmark datasets with synthetic label noise on the training set. However, there are multiple types of label noise, and each of them has its own characteristic impact on learning. Since each work generates a different kind of label noise, it is problematic to test and compare those algorithms in the literature fairly. In this work, we provide a detailed analysis of the effects of different kinds of label noise on learning. Moreover, we propose a generic framework to generate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Advanced Neural Network Applications
MethodsSoftmax
