Towards a New Understanding of the Training of Neural Networks with   Mislabeled Training Data

Herbert Gish; Jan Silovsky; Man-Ling Sung; Man-Hung Siu; William; Hartmann; Zhuolin Jiang

arXiv:1909.09136·cs.LG·September 23, 2019

Towards a New Understanding of the Training of Neural Networks with Mislabeled Training Data

Herbert Gish, Jan Silovsky, Man-Ling Sung, Man-Hung Siu, William, Hartmann, Zhuolin Jiang

PDF

Open Access

TL;DR

This paper analyzes the impact of mislabeled data on neural network training, providing theoretical insights and proposing a practical approach to improve classifier robustness by adjusting decision thresholds based on noise levels.

Contribution

It offers a theoretical analysis of noisy models, showing ML estimates relate to clean models, and introduces a method to train classifiers on noisy data with threshold adjustments.

Findings

01

ML estimates of noisy models determine those of clean models

02

Adjusting decision thresholds improves classification with mislabeled data

03

The approach works with multi-layer perceptrons (MLPs)

Abstract

We investigate the problem of machine learning with mislabeled training data. We try to make the effects of mislabeled training better understood through analysis of the basic model and equations that characterize the problem. This includes results about the ability of the noisy model to make the same decisions as the clean model and the effects of noise on model performance. In addition to providing better insights we also are able to show that the Maximum Likelihood (ML) estimate of the parameters of the noisy model determine those of the clean model. This property is obtained through the use of the ML invariance property and leads to an approach to developing a classifier when training has been mislabeled: namely train the classifier on noisy data and adjust the decision threshold based on the noise levels and/or class priors. We show how our approach to mislabeled training works…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Industrial Vision Systems and Defect Detection