Robust Testing for Deep Learning using Human Label Noise
Gordon Lim, Stefan Larson, Kevin Leach

TL;DR
This paper introduces a new method for generating realistic human-like label noise in deep learning datasets, enabling more accurate evaluation of models' robustness to real-world labeling errors.
Contribution
We propose Cluster-Based Noise (CBN), a feature-dependent noise generation method that simulates human label noise, and Soft Neighbor Label Sampling (SNLS), a technique to improve model training under this noise.
Findings
CBN creates more realistic noisy labels than synthetic methods.
Current LNL methods perform worse under CBN, indicating increased challenge.
SNLS improves model robustness against human-like label noise.
Abstract
In deep learning (DL) systems, label noise in training datasets often degrades model performance, as models may learn incorrect patterns from mislabeled data. The area of Learning with Noisy Labels (LNL) has introduced methods to effectively train DL models in the presence of noisily-labeled datasets. Traditionally, these methods are tested using synthetic label noise, where ground truth labels are randomly (and automatically) flipped. However, recent findings highlight that models perform substantially worse under human label noise than synthetic label noise, indicating a need for more realistic test scenarios that reflect noise introduced due to imperfect human labeling. This underscores the need for generating realistic noisy labels that simulate human label noise, enabling rigorous testing of deep neural networks without the need to collect new human-labeled datasets. To address…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Advanced Chemical Sensor Technologies · Advanced Statistical Process Monitoring
