Deep Learning is Provably Robust to Symmetric Label Noise
Carey E. Priebe, Ningyuan Huang, Soledad Villar, Cong Mu, Li Chen

TL;DR
This paper demonstrates that certain deep neural networks can be inherently robust to high levels of symmetric label noise, achieving optimal classification without mitigation strategies, under specific theoretical conditions.
Contribution
It provides a theoretical proof that $L_1$-consistent deep neural networks can attain Bayes optimality under symmetric label noise below a certain threshold, without needing noise mitigation.
Findings
DNNs can tolerate symmetric label noise up to the information-theoretic limit.
$L_1$-consistent DNN classifiers achieve Bayes optimality under symmetric noise.
No mitigation is necessary for $L_1$-consistent estimators under symmetric label noise.
Abstract
Deep neural networks (DNNs) are capable of perfectly fitting the training data, including memorizing noisy data. It is commonly believed that memorization hurts generalization. Therefore, many recent works propose mitigation strategies to avoid noisy data or correct memorization. In this work, we step back and ask the question: Can deep learning be robust against massive label noise without any mitigation? We provide an affirmative answer for the case of symmetric label noise: We find that certain DNNs, including under-parameterized and over-parameterized models, can tolerate massive symmetric label noise up to the information-theoretic threshold. By appealing to classical statistical theory and universal consistency of DNNs, we prove that for multiclass classification, -consistent DNN classifiers trained under symmetric label noise can achieve Bayes optimality asymptotically if…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Statistical Methods and Models · Machine Learning and Algorithms
