Making Deep Neural Networks Robust to Label Noise: a Loss Correction   Approach

Giorgio Patrini; Alessandro Rozza; Aditya Menon; Richard Nock; Lizhen; Qu

arXiv:1609.03683·stat.ML·March 23, 2017·114 cites

Making Deep Neural Networks Robust to Label Noise: a Loss Correction Approach

Giorgio Patrini, Alessandro Rozza, Aditya Menon, Richard Nock, Lizhen, Qu

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces loss correction methods to train deep neural networks robustly against class-dependent label noise, supported by theoretical analysis and extensive experiments across multiple datasets and architectures.

Contribution

It proposes two loss correction procedures that are simple, architecture-agnostic, and include an end-to-end noise estimation technique, advancing robustness in noisy label scenarios.

Findings

01

Methods improve robustness on diverse datasets

02

Loss correction is effective across various architectures

03

Loss curvature remains unaffected by label noise with ReLU

Abstract

We present a theoretically grounded approach to train deep neural networks, including recurrent networks, subject to class-dependent label noise. We propose two procedures for loss correction that are agnostic to both application domain and network architecture. They simply amount to at most a matrix inversion and multiplication, provided that we know the probability of each class being corrupted into another. We further show how one can estimate these probabilities, adapting a recent technique for noise estimation to the multi-class setting, and thus providing an end-to-end framework. Extensive experiments on MNIST, IMDB, CIFAR-10, CIFAR-100 and a large scale dataset of clothing images employing a diversity of architectures --- stacking dense, convolutional, pooling, dropout, batch normalization, word embedding, LSTM and residual layers --- demonstrate the noise robustness of our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach· youtube

Taxonomy

TopicsMachine Learning and Data Classification · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications

MethodsSigmoid Activation · Tanh Activation · *Communicated@Fast*How Do I Communicate to Expedia? · Long Short-Term Memory