Similarities between Arabic Dialects: Investigating Geographical Proximity
Abdulkareem Alsudais, Wafa Alotaibi, Faye Alomary

TL;DR
This study investigates how geographical proximity influences dialectical similarities among Arabic-speaking cities, revealing that nearby cities often share dialect features regardless of national borders, which impacts dialect classification strategies.
Contribution
It introduces a nuanced analysis of dialect similarities based on geographic proximity, challenging traditional country-based dialect classification methods.
Findings
Cities closer geographically tend to have more similar dialects.
Dialect similarity can cross national borders based on proximity.
Geographical proximity is a significant factor in dialect classification.
Abstract
The automatic classification of Arabic dialects is an ongoing research challenge, which has been explored in recent work that defines dialects based on increasingly limited geographic areas like cities and provinces. This paper focuses on a related yet relatively unexplored topic: the effects of the geographical proximity of cities located in Arab countries on their dialectical similarity. Our work is twofold, reliant on: 1) comparing the textual similarities between dialects using cosine similarity and 2) measuring the geographical distance between locations. We study MADAR and NADI, two established datasets with Arabic dialects from many cities and provinces. Our results indicate that cities located in different countries may in fact have more dialectical similarity than cities within the same country, depending on their geographical proximity. The correlation between dialectical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
