Self-Training Pre-Trained Language Models for Zero- and Few-Shot   Multi-Dialectal Arabic Sequence Labeling

Muhammad Khalifa; Muhammad Abdul-Mageed; Khaled Shaalan

arXiv:2101.04758·cs.CL·February 4, 2021

Self-Training Pre-Trained Language Models for Zero- and Few-Shot Multi-Dialectal Arabic Sequence Labeling

Muhammad Khalifa, Muhammad Abdul-Mageed, Khaled Shaalan

PDF

1 Repo

TL;DR

This paper presents a self-training approach for pre-trained language models to improve zero- and few-shot sequence labeling in Arabic dialects using only resources from Modern Standard Arabic, demonstrating significant performance gains.

Contribution

It introduces a self-training method that enhances dialectal Arabic sequence labeling by leveraging data-rich MSA resources in low-resource scenarios, with extensive empirical validation.

Findings

01

Self-training improves zero-shot transfer by up to ~10% F1 for NER.

02

Performance gains are confirmed through ablation studies.

03

Method extends to other languages and tasks.

Abstract

A sufficient amount of annotated data is usually required to fine-tune pre-trained language models for downstream tasks. Unfortunately, attaining labeled data can be costly, especially for multiple language varieties and dialects. We propose to self-train pre-trained language models in zero- and few-shot scenarios to improve performance on data-scarce varieties using only resources from data-rich ones. We demonstrate the utility of our approach in the context of Arabic sequence labeling by using a language model fine-tuned on Modern Standard Arabic (MSA) only to predict named entities (NE) and part-of-speech (POS) tags on several dialectal Arabic (DA) varieties. We show that self-training is indeed powerful, improving zero-shot MSA-to-DA transfer by as large as \texttildelow 10\% F $_{1}$ (NER) and 2\% accuracy (POS tagging). We acquire even better performance in few-shot scenarios with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mohammadKhalifa/zero-shot-arabic-dialects
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.