Quran Recitation Recognition using End-to-End Deep Learning

Ahmad Al Harere; Khloud Al Jallad

arXiv:2305.07034·eess.AS·May 15, 2023·6 cites

Quran Recitation Recognition using End-to-End Deep Learning

Ahmad Al Harere, Khloud Al Jallad

PDF

Open Access

TL;DR

This paper introduces an end-to-end deep learning model for automatic Quran recitation recognition, achieving promising accuracy on a new public dataset, and aims to establish a baseline for future research in this domain.

Contribution

The paper presents a novel CNN-Bidirectional GRU model with CTC and beam search decoding for Quran recitation recognition, utilizing a large public dataset for evaluation.

Findings

01

Achieved 8.34% WER and 2.42% CER on the Ar-DAD dataset.

02

Proposed model outperforms previous approaches on this dataset.

03

Provides a baseline for future Quran recitation recognition research.

Abstract

The Quran is the holy scripture of Islam, and its recitation is an important aspect of the religion. Recognizing the recitation of the Holy Quran automatically is a challenging task due to its unique rules that are not applied in normal speaking speeches. A lot of research has been done in this domain, but previous works have detected recitation errors as a classification task or used traditional automatic speech recognition (ASR). In this paper, we proposed a novel end-to-end deep learning model for recognizing the recitation of the Holy Quran. The proposed model is a CNN-Bidirectional GRU encoder that uses CTC as an objective function, and a character-based decoder which is a beam search decoder. Moreover, all previous works were done on small private datasets consisting of short verses and a few chapters of the Holy Quran. As a result of using private datasets, no comparisons were…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis

MethodsGated Recurrent Unit