NADI 2022: The Third Nuanced Arabic Dialect Identification Shared Task

Muhammad Abdul-Mageed; Chiyu Zhang; AbdelRahim Elmadany; Houda; Bouamor; Nizar Habash

arXiv:2210.09582·cs.CL·October 24, 2022

NADI 2022: The Third Nuanced Arabic Dialect Identification Shared Task

Muhammad Abdul-Mageed, Chiyu Zhang, AbdelRahim Elmadany, Houda, Bouamor, Nizar Habash

PDF

Open Access 1 Repo

TL;DR

NADI 2022 is a shared task focusing on Arabic dialect identification and sentiment analysis, providing datasets and benchmarks to advance NLP in Arabic dialects, with participation from 21 teams and challenging results.

Contribution

It introduces a standardized platform with diverse datasets for Arabic dialect tasks, fostering fair comparison and progress in dialect identification and sentiment analysis.

Findings

01

Winning team achieved 27.06 F1 in dialect identification.

02

Sentiment analysis F1 reached 75.16 by top team.

03

Both tasks remain challenging, indicating need for further research.

Abstract

We describe findings of the third Nuanced Arabic Dialect Identification Shared Task (NADI 2022). NADI aims at advancing state of the art Arabic NLP, including on Arabic dialects. It does so by affording diverse datasets and modeling opportunities in a standardized context where meaningful comparisons between models and approaches are possible. NADI 2022 targeted both dialect identification (Subtask 1) and dialectal sentiment analysis (Subtask 2) at the country level. A total of 41 unique teams registered for the shared task, of whom 21 teams have actually participated (with 105 valid submissions). Among these, 19 teams participated in Subtask 1 and 10 participated in Subtask 2. The winning team achieved 27.06 F1 on Subtask 1 and F1=75.16 on Subtask 2, reflecting that the two subtasks remain challenging and motivating future work in this area. We describe methods employed by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

UBC-NLP/nadi
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Authorship Attribution and Profiling · Linguistic Variation and Morphology