Learning from Noisy Labels via Self-Taught On-the-Fly Meta Loss Rescaling
Michael Heck, Christian Geishauser, Nurul Lubis, Carel van Niekerk,, Shutong Feng, Hsien-Chin Lin, Benjamin Matthias Ruppik, Renato Vukovic,, Milica Ga\v{s}i\'c

TL;DR
This paper introduces an unsupervised, real-time meta learning approach for reweighting training samples to mitigate the impact of noisy labels without requiring clean seed data, improving NLP task performance.
Contribution
It proposes a novel on-the-fly meta loss rescaling method that learns to reweight samples using only model features, applicable across various NLP tasks including dialogue modeling.
Findings
Consistently improves NLP task performance with minimal overhead
Effective on noisy and ambiguous labels, including dialogue modeling
Robust against class imbalance and overfitting to noisy data
Abstract
Correct labels are indispensable for training effective machine learning models. However, creating high-quality labels is expensive, and even professionally labeled data contains errors and ambiguities. Filtering and denoising can be applied to curate labeled data prior to training, at the cost of additional processing and loss of information. An alternative is on-the-fly sample reweighting during the training process to decrease the negative impact of incorrect or ambiguous labels, but this typically requires clean seed data. In this work we propose unsupervised on-the-fly meta loss rescaling to reweight training samples. Crucially, we rely only on features provided by the model being trained, to learn a rescaling function in real time without knowledge of the true clean data distribution. We achieve this via a novel meta learning setup that samples validation data for the meta update…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Text and Document Classification Technologies · Educational Technology and Assessment
