NoisywikiHow: A Benchmark for Learning with Real-world Noisy Labels in Natural Language Processing
Tingting Wu, Xiao Ding, Minji Tang, Hao Zhang, Bing Qin, Ting Liu

TL;DR
This paper introduces NoisywikiHow, a large NLP benchmark with realistic, heterogeneous label noise inspired by human errors, enabling systematic evaluation of learning with noisy labels in real-world scenarios.
Contribution
It presents the first large-scale NLP benchmark with instance-dependent noise mimicking real-world errors, supporting controlled experiments for better LNL method evaluation.
Findings
LNL methods show varied performance across noise levels.
Realistic noise impacts model generalization more than synthetic noise.
Benchmark enables comprehensive evaluation of noise-robust learning techniques.
Abstract
Large-scale datasets in the real world inevitably involve label noise. Deep models can gradually overfit noisy labels and thus degrade model generalization. To mitigate the effects of label noise, learning with noisy labels (LNL) methods are designed to achieve better generalization performance. Due to the lack of suitable datasets, previous studies have frequently employed synthetic label noise to mimic real-world label noise. However, synthetic noise is not instance-dependent, making this approximation not always effective in practice. Recent research has proposed benchmarks for learning with real-world noisy labels. However, the noise sources within may be single or fuzzy, making benchmarks different from data with heterogeneous label noises in the real world. To tackle these issues, we contribute NoisywikiHow, the largest NLP benchmark built with minimal supervision. Specifically,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Water Systems and Optimization · Anomaly Detection Techniques and Applications
