Making Deep Neural Networks Robust to Label Noise: a Loss Correction Approach
Giorgio Patrini, Alessandro Rozza, Aditya Menon, Richard Nock, Lizhen, Qu

TL;DR
This paper introduces loss correction methods to train deep neural networks robustly against class-dependent label noise, supported by theoretical analysis and extensive experiments across multiple datasets and architectures.
Contribution
It proposes two loss correction procedures that are simple, architecture-agnostic, and include an end-to-end noise estimation technique, advancing robustness in noisy label scenarios.
Findings
Methods improve robustness on diverse datasets
Loss correction is effective across various architectures
Loss curvature remains unaffected by label noise with ReLU
Abstract
We present a theoretically grounded approach to train deep neural networks, including recurrent networks, subject to class-dependent label noise. We propose two procedures for loss correction that are agnostic to both application domain and network architecture. They simply amount to at most a matrix inversion and multiplication, provided that we know the probability of each class being corrupted into another. We further show how one can estimate these probabilities, adapting a recent technique for noise estimation to the multi-class setting, and thus providing an end-to-end framework. Extensive experiments on MNIST, IMDB, CIFAR-10, CIFAR-100 and a large scale dataset of clothing images employing a diversity of architectures --- stacking dense, convolutional, pooling, dropout, batch normalization, word embedding, LSTM and residual layers --- demonstrate the noise robustness of our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach· youtube
Taxonomy
TopicsMachine Learning and Data Classification · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications
MethodsSigmoid Activation · Tanh Activation · *Communicated@Fast*How Do I Communicate to Expedia? · Long Short-Term Memory
