ASMDD: Arabic Speech Mispronunciation Detection Dataset

Salah A. Aly; Abdelrahman Salah; Hesham M. Eraqi

arXiv:2111.01136·cs.CL·November 3, 2021

ASMDD: Arabic Speech Mispronunciation Detection Dataset

Salah A. Aly, Abdelrahman Salah, Hesham M. Eraqi

PDF

Open Access

TL;DR

This paper introduces ASMDD, the largest annotated dataset of Egyptian children's Arabic speech for mispronunciation detection, focusing on the top 100 frequently used words.

Contribution

It provides a comprehensive, expert-annotated dataset specifically designed for Arabic speech mispronunciation detection in Egyptian children.

Findings

01

Largest dataset of its kind for Arabic speech mispronunciation detection

02

Includes detailed annotations by expert listeners

03

Focuses on commonly used words in Egyptian Arabic

Abstract

The largest dataset of Arabic speech mispronunciation detections in Egyptian dialogues is introduced. The dataset is composed of annotated audio files representing the top 100 words that are most frequently used in the Arabic language, pronounced by 100 Egyptian children (aged between 2 and 8 years old). The dataset is collected and annotated on segmental pronunciation error detections by expert listeners.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Speech and dialogue systems