Handling Realistic Label Noise in BERT Text Classification
Maha Tufail Agro, Hanan Aldarmaki

TL;DR
This paper investigates how realistic, feature-dependent label noise affects BERT text classification and evaluates ensemble and noise-cleaning methods to enhance robustness against such noise.
Contribution
It introduces an analysis of BERT's vulnerability to realistic label noise and compares various ensemble and cleaning techniques for improved robustness.
Findings
Realistic label noise significantly degrades BERT performance.
Ensemble methods can mitigate the impact of label noise.
Noise-cleaning techniques improve classification accuracy under noisy conditions.
Abstract
Labels noise refers to errors in training labels caused by cheap data annotation methods, such as web scraping or crowd-sourcing, which can be detrimental to the performance of supervised classifiers. Several methods have been proposed to counteract the effect of random label noise in supervised classification, and some studies have shown that BERT is already robust against high rates of randomly injected label noise. However, real label noise is not random; rather, it is often correlated with input features or other annotator-specific factors. In this paper, we evaluate BERT in the presence of two types of realistic label noise: feature-dependent label noise, and synthetic label noise from annotator disagreements. We show that the presence of these types of noise significantly degrades BERT classification performance. To improve robustness, we evaluate different types of ensembles and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Imbalanced Data Classification Techniques · Infrastructure Maintenance and Monitoring
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Adam · Dense Connections · WordPiece · Weight Decay · Linear Warmup With Linear Decay · Attention Dropout
