Iranian Modal Music (Dastgah) detection using deep neural networks
Danial Ebrat, Farzad Didehvar, Milad Dadgar

TL;DR
This study applies deep neural networks to recognize Iranian modal music (Dastgah) with high accuracy, highlighting the effectiveness of MFCC features and autoencoder-inspired architectures in distinguishing highly correlated musical categories.
Contribution
Introduces a deep learning approach, BiLGNet, for Iranian modal music classification, demonstrating superior accuracy and robustness over other models in non-western music genre detection.
Findings
BiLGNet achieved 92% accuracy in Dastgah detection.
MFCC features outperform other sound representations.
Bidirectional recurrent networks are most effective for this task.
Abstract
Music classification and genre detection are topics in music information retrieval (MIR) that many articles have been published regarding their utilities in the modern world. However, this contribution is insufficient in non-western music, such as Iranian modal music. In this work, we have implemented several deep neural networks to recognize Iranian modal music in seven highly correlated categories. The best model, BiLGNet, which achieved 92 percent overall accuracy, uses an architecture inspired by autoencoders, including bidirectional LSTM and GRU layers. We trained the models using the Nava dataset, which includes 1786 records and up to 55 hours of music played solo by Kamanche, Tar, Setar, Reed, and Santoor (Dulcimer). We considered Multiple features such as MFCC, Chroma CENS, and Mel spectrogram as input. The results indicate that MFCC carries more valuable information for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Diverse Musicological Studies · Music Education and Analysis
MethodsGated Recurrent Unit · Sigmoid Activation · Tanh Activation · Long Short-Term Memory · Bidirectional GRU · Bidirectional LSTM
