Robust Loss Functions under Label Noise for Deep Neural Networks
Aritra Ghosh, Himanshu Kumar, P.S. Sastry

TL;DR
This paper investigates loss functions that are inherently robust to label noise in deep neural networks, providing theoretical conditions and demonstrating that certain loss functions like mean absolute error improve noise tolerance.
Contribution
It generalizes noise-tolerant loss function conditions from binary to multiclass classification and identifies the mean absolute error loss as inherently robust to label noise.
Findings
Mean absolute error loss is robust to label noise in deep networks.
Theoretical conditions for noise-tolerant loss functions are extended to multiclass problems.
Experiments confirm robustness of risk minimization with such loss functions.
Abstract
In many applications of classifier learning, training data suffers from label noise. Deep networks are learned using huge training data where the problem of noisy labels is particularly relevant. The current techniques proposed for learning deep networks under label noise focus on modifying the network architecture and on algorithms for estimating true labels from noisy labels. An alternate approach would be to look for loss functions that are inherently noise-tolerant. For binary classification there exist theoretical results on loss functions that are robust to label noise. In this paper, we provide some sufficient conditions on a loss function so that risk minimization under that loss function would be inherently tolerant to label noise for multiclass classification problems. These results generalize the existing results on noise-tolerant loss functions for binary classification. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Advanced Multi-Objective Optimization Algorithms
