Quran Recitation Recognition using End-to-End Deep Learning
Ahmad Al Harere, Khloud Al Jallad

TL;DR
This paper introduces an end-to-end deep learning model for automatic Quran recitation recognition, achieving promising accuracy on a new public dataset, and aims to establish a baseline for future research in this domain.
Contribution
The paper presents a novel CNN-Bidirectional GRU model with CTC and beam search decoding for Quran recitation recognition, utilizing a large public dataset for evaluation.
Findings
Achieved 8.34% WER and 2.42% CER on the Ar-DAD dataset.
Proposed model outperforms previous approaches on this dataset.
Provides a baseline for future Quran recitation recognition research.
Abstract
The Quran is the holy scripture of Islam, and its recitation is an important aspect of the religion. Recognizing the recitation of the Holy Quran automatically is a challenging task due to its unique rules that are not applied in normal speaking speeches. A lot of research has been done in this domain, but previous works have detected recitation errors as a classification task or used traditional automatic speech recognition (ASR). In this paper, we proposed a novel end-to-end deep learning model for recognizing the recitation of the Holy Quran. The proposed model is a CNN-Bidirectional GRU encoder that uses CTC as an objective function, and a character-based decoder which is a beam search decoder. Moreover, all previous works were done on small private datasets consisting of short verses and a few chapters of the Holy Quran. As a result of using private datasets, no comparisons were…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis
MethodsGated Recurrent Unit
