Dialectal Coverage And Generalization in Arabic Speech Recognition

Amirbek Djanibekov; Hawau Olamide Toyin; Raghad Alshalan; Abdullah Alitr; Hanan Aldarmaki

arXiv:2411.05872·cs.CL·June 2, 2025

Dialectal Coverage And Generalization in Arabic Speech Recognition

Amirbek Djanibekov, Hawau Olamide Toyin, Raghad Alshalan, Abdullah Alitr, Hanan Aldarmaki

PDF

Open Access 1 Repo 5 Models

TL;DR

This paper presents a suite of open-source Arabic speech recognition models that effectively recognize multiple dialects, MSA, and code-switching, improving coverage and performance across diverse spoken variants.

Contribution

It introduces new multilingual and dialect-specific ASR models for Arabic, covering 17 countries and multiple dialects, with demonstrated performance improvements.

Findings

01

Enhanced recognition accuracy across Arabic dialects

02

Effective handling of code-switching scenarios

03

Open-source models covering diverse Arabic varieties

Abstract

Developing robust automatic speech recognition (ASR) systems for Arabic requires effective strategies to manage its diversity. Existing ASR systems mainly cover the modern standard Arabic (MSA) variety and few high-resource dialects, but fall short in coverage and generalization across the multitude of spoken variants. Code-switching with English and French is also common in different regions of the Arab world, which challenges the performance of monolingual Arabic models. In this work, we introduce a suite of ASR models optimized to effectively recognize multiple variants of spoken Arabic, including MSA, various dialects, and code-switching. We provide open-source pre-trained models that cover data from 17 Arabic-speaking countries, and fine-tuned MSA and dialectal ASR models that include at least 11 variants, as well as multi-lingual ASR models covering embedded languages in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mbzuai-nlp/artst
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis