ArabicDialectHub: A Cross-Dialectal Arabic Learning Resource and Platform
Salem Lahlou

TL;DR
ArabicDialectHub is a comprehensive, open-source platform and dataset that facilitates cross-dialectal Arabic learning through interactive features, validated phrases, and cultural context, supporting learners across six Arabic varieties.
Contribution
It introduces a novel cross-dialectal Arabic resource with an interactive platform, generated and validated using LLMs and native speakers, enhancing language learning tools.
Findings
Generated 552 validated phrases across six Arabic dialects.
Developed an interactive platform with adaptive quizzing and cultural context.
Released open-source dataset and platform under MIT license.
Abstract
We present ArabicDialectHub, a cross-dialectal Arabic learning resource comprising 552 phrases across six varieties (Moroccan Darija, Lebanese, Syrian, Emirati, Saudi, and MSA) and an interactive web platform. Phrases were generated using LLMs and validated by five native speakers, stratified by difficulty, and organized thematically. The open-source platform provides translation exploration, adaptive quizzing with algorithmic distractor generation, cloud-synchronized progress tracking, and cultural context. Both the dataset and complete platform source code are released under MIT license. Platform: https://arabic-dialect-hub.netlify.app.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Language and cultural evolution · Linguistic Variation and Morphology
