An Effective Label Noise Model for DNN Text Classification
Ishan Jindal, Daniel Pressel, Brian Lester, Matthew Nokleby

TL;DR
This paper presents a novel noise-robust training method for deep neural networks in text classification, introducing a noise model layer that jointly learns with the classifier to handle label errors effectively.
Contribution
It proposes a non-linear noise model layer integrated into CNNs that jointly learns noise statistics and improves robustness to label noise in text classification.
Findings
The approach improves sentence representations under label noise.
Proper initialization and regularization of the noise model are crucial.
Changing batch size has minimal impact on performance.
Abstract
Because large, human-annotated datasets suffer from labeling errors, it is crucial to be able to train deep neural networks in the presence of label noise. While training image classification models with label noise have received much attention, training text classification models have not. In this paper, we propose an approach to training deep networks that is robust to label noise. This approach introduces a non-linear processing layer (noise model) that models the statistics of the label noise into a convolutional neural network (CNN) architecture. The noise model and the CNN weights are learned jointly from noisy training data, which prevents the model from overfitting to erroneous labels. Through extensive experiments on several text classification datasets, we show that this approach enables the CNN to learn better sentence representations and is robust even to extreme label…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Imbalanced Data Classification Techniques
