Voices of the Mountains: Deep Learning-Based Vocal Error Detection System for Kurdish Maqams
Darvan Shvan Khairaldeen, Hossein Hassani

TL;DR
This paper presents a novel deep learning system for detecting pitch, rhythm, and modal errors in Kurdish maqam singing, addressing the limitations of Western-based automatic singing assessment tools.
Contribution
It introduces the first error detection system tailored for Kurdish maqam, capturing microtonal and modal errors using a CNN-BiLSTM model trained on annotated Kurdish singing data.
Findings
Model achieved macro-F1 of 0.468 on validation
Detected errors with 39.4% recall and 25.8% precision at threshold 0.750
Higher accuracy for pitch and rhythm errors compared to modal drift
Abstract
Maqam, a singing type, is a significant component of Kurdish music. A maqam singer receives training in a traditional face-to-face or through self-training. Automatic Singing Assessment (ASA) uses machine learning (ML) to provide the accuracy of singing styles and can help learners to improve their performance through error detection. Currently, the available ASA tools follow Western music rules. The musical composition requires all notes to stay within their expected pitch range from start to finish. The system fails to detect micro-intervals and pitch bends, so it identifies Kurdish maqam singing as incorrect even though the singer performs according to traditional rules. Kurdish maqam requires recognizing performance errors within microtonal spaces, which is beyond Western equal temperament. This research is the first attempt to address the mentioned gap. While many error types…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Emotion and Mood Recognition
