Open Universal Arabic ASR Leaderboard

Yingzhi Wang; Anas Alhmoud; Muhammad Alqurishi

arXiv:2412.13788·cs.CL·December 19, 2024

Open Universal Arabic ASR Leaderboard

Yingzhi Wang, Anas Alhmoud, Muhammad Alqurishi

PDF

Open Access 1 Repo

TL;DR

This paper introduces a comprehensive benchmark for open-source Arabic speech recognition models across multiple dialects, evaluating their performance, robustness, and efficiency to advance the development of universal Arabic ASR systems.

Contribution

It presents the Open Universal Arabic ASR Leaderboard, establishing a standardized evaluation framework for multi-dialect Arabic ASR models and providing detailed analysis of their capabilities.

Findings

01

Benchmark covers diverse multi-dialect datasets

02

Analysis includes robustness, speaker adaptation, efficiency, and memory use

03

Provides a reference for model performance and generalization

Abstract

In recent years, the enhanced capabilities of ASR models and the emergence of multi-dialect datasets have increasingly pushed Arabic ASR model development toward an all-dialect-in-one direction. This trend highlights the need for benchmarking studies that evaluate model performance on multiple dialects, providing the community with insights into models' generalization capabilities. In this paper, we introduce Open Universal Arabic ASR Leaderboard, a continuous benchmark project for open-source general Arabic ASR models across various multi-dialect datasets. We also provide a comprehensive analysis of the model's robustness, speaker adaptation, inference efficiency, and memory consumption. This work aims to offer the Arabic ASR community a reference for models' general performance and also establish a common evaluation framework for multi-dialectal Arabic ASR models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Natural-Language-Processing-Elm/open_universal_arabic_asr_leaderboard
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Automated Systems · Natural Language Processing Techniques