Task-Adaptive Pre-Training for Boosting Learning With Noisy Labels: A   Study on Text Classification for African Languages

Dawei Zhu; Michael A. Hedderich; Fangzhou Zhai; David Ifeoluwa; Adelani; Dietrich Klakow

arXiv:2206.01476·cs.CL·June 6, 2022

Task-Adaptive Pre-Training for Boosting Learning With Noisy Labels: A Study on Text Classification for African Languages

Dawei Zhu, Michael A. Hedderich, Fangzhou Zhai, David Ifeoluwa, Adelani, Dietrich Klakow

PDF

Open Access

TL;DR

This paper investigates the effectiveness of task-adaptive pre-training techniques in improving text classification performance in low-resource African languages with noisy labels, comparing various noise-handling methods.

Contribution

It demonstrates that task-adaptive pre-training significantly enhances learning with noisy labels in low-resource language text classification tasks.

Findings

01

Task-adaptive pre-training improves accuracy with noisy labels.

02

Noise-handling methods combined with pre-training yield better results.

03

Realistic noise scenarios are effectively mitigated by the proposed approach.

Abstract

For high-resource languages like English, text classification is a well-studied task. The performance of modern NLP models easily achieves an accuracy of more than 90% in many standard datasets for text classification in English (Xie et al., 2019; Yang et al., 2019; Zaheer et al., 2020). However, text classification in low-resource languages is still challenging due to the lack of annotated data. Although methods like weak supervision and crowdsourcing can help ease the annotation bottleneck, the annotations obtained by these methods contain label noise. Models trained with label noise may not generalize well. To this end, a variety of noise-handling techniques have been proposed to alleviate the negative impact caused by the errors in the annotations (for extensive surveys see (Hedderich et al., 2021; Algan & Ulusoy, 2021)). In this work, we experiment with a group of standard…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Text and Document Classification Technologies · Machine Learning and Algorithms