Automatic Pronunciation Error Detection and Correction of the Holy Quran's Learners Using Deep Learning
Abdullah Abdelfattah, Mahmoud I. Khalil, Hazem Abbas

TL;DR
This paper presents a deep learning-based system for automatic pronunciation error detection and correction in Quranic recitation, leveraging a large annotated dataset, a novel phonetic encoding, and a multi-level CTC model, with open-source resources.
Contribution
It introduces a high-quality Quranic dataset creation pipeline, a Quran-specific phonetic script, and a multi-level CTC model achieving state-of-the-art accuracy for pronunciation error detection.
Findings
Achieved 0.16% Phoneme Error Rate on test set.
Created 850+ hours of annotated Quranic audio data.
Developed a novel Quran Phonetic Script (QPS) for error detection.
Abstract
Assessing spoken language is challenging, and quantifying pronunciation metrics for machine learning models is even harder. However, for the Holy Quran, this task is simplified by the rigorous recitation rules (tajweed) established by Muslim scholars, enabling highly effective assessment. Despite this advantage, the scarcity of high-quality annotated data remains a significant barrier. In this work, we bridge these gaps by introducing: (1) A 98% automated pipeline to produce high-quality Quranic datasets -- encompassing: Collection of recitations from expert reciters, Segmentation at pause points (waqf) using our fine-tuned wav2vec2-BERT model, Transcription of segments, Transcript verification via our novel Tasmeea algorithm; (2) 850+ hours of audio (~300K annotated utterances); (3) A novel ASR-based approach for pronunciation error detection, utilizing our custom Quran Phonetic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Language, Linguistics, Cultural Analysis
