NADI 2022: The Third Nuanced Arabic Dialect Identification Shared Task
Muhammad Abdul-Mageed, Chiyu Zhang, AbdelRahim Elmadany, Houda, Bouamor, Nizar Habash

TL;DR
NADI 2022 is a shared task focusing on Arabic dialect identification and sentiment analysis, providing datasets and benchmarks to advance NLP in Arabic dialects, with participation from 21 teams and challenging results.
Contribution
It introduces a standardized platform with diverse datasets for Arabic dialect tasks, fostering fair comparison and progress in dialect identification and sentiment analysis.
Findings
Winning team achieved 27.06 F1 in dialect identification.
Sentiment analysis F1 reached 75.16 by top team.
Both tasks remain challenging, indicating need for further research.
Abstract
We describe findings of the third Nuanced Arabic Dialect Identification Shared Task (NADI 2022). NADI aims at advancing state of the art Arabic NLP, including on Arabic dialects. It does so by affording diverse datasets and modeling opportunities in a standardized context where meaningful comparisons between models and approaches are possible. NADI 2022 targeted both dialect identification (Subtask 1) and dialectal sentiment analysis (Subtask 2) at the country level. A total of 41 unique teams registered for the shared task, of whom 21 teams have actually participated (with 105 valid submissions). Among these, 19 teams participated in Subtask 1 and 10 participated in Subtask 2. The winning team achieved 27.06 F1 on Subtask 1 and F1=75.16 on Subtask 2, reflecting that the two subtasks remain challenging and motivating future work in this area. We describe methods employed by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Authorship Attribution and Profiling · Linguistic Variation and Morphology
