Voices of the Mountains: Deep Learning-Based Vocal Error Detection System for Kurdish Maqams

Darvan Shvan Khairaldeen; Hossein Hassani

arXiv:2602.20744·cs.SD·February 25, 2026

Voices of the Mountains: Deep Learning-Based Vocal Error Detection System for Kurdish Maqams

Darvan Shvan Khairaldeen, Hossein Hassani

PDF

Open Access

TL;DR

This paper presents a novel deep learning system for detecting pitch, rhythm, and modal errors in Kurdish maqam singing, addressing the limitations of Western-based automatic singing assessment tools.

Contribution

It introduces the first error detection system tailored for Kurdish maqam, capturing microtonal and modal errors using a CNN-BiLSTM model trained on annotated Kurdish singing data.

Findings

01

Model achieved macro-F1 of 0.468 on validation

02

Detected errors with 39.4% recall and 25.8% precision at threshold 0.750

03

Higher accuracy for pitch and rhythm errors compared to modal drift

Abstract

Maqam, a singing type, is a significant component of Kurdish music. A maqam singer receives training in a traditional face-to-face or through self-training. Automatic Singing Assessment (ASA) uses machine learning (ML) to provide the accuracy of singing styles and can help learners to improve their performance through error detection. Currently, the available ASA tools follow Western music rules. The musical composition requires all notes to stay within their expected pitch range from start to finish. The system fails to detect micro-intervals and pitch bends, so it identifies Kurdish maqam singing as incorrect even though the singer performs according to traditional rules. Kurdish maqam requires recognizing performance errors within microtonal spaces, which is beyond Western equal temperament. This research is the first attempt to address the mentioned gap. While many error types…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Music Technology and Sound Studies · Emotion and Mood Recognition