Explainable Deep Learning Analysis for Raga Identification in Indian Art Music
Parampreet Singh, Vipul Arora

TL;DR
This paper develops an explainable deep learning approach for Indian Raga identification, creating a new dataset, training a CNN-LSTM model, and applying explainability techniques to ensure the model's reasoning aligns with human expert understanding.
Contribution
It introduces a curated dataset of Hindustani Classical Music, trains a CNN-LSTM model for Raga identification, and employs explainability methods to interpret model decisions in line with human expertise.
Findings
Achieved 0.89 F1-score on Raga classification
Model explanations align well with human expert annotations
Demonstrated the effectiveness of explainability techniques in music classification
Abstract
Raga identification is an important problem within the domain of Indian Art music, as Ragas are fundamental to its composition and performance, playing a crucial role in music retrieval, preservation, and education. Few studies that have explored this task employ approaches such as signal processing, Machine Learning (ML), and more recently, Deep Learning (DL) based methods. However, a key question remains unanswered in all these works: do these ML/DL methods learn and interpret Ragas in a manner similar to human experts? Besides, a significant roadblock in this research is the unavailability of an ample supply of rich, labeled datasets, which drives these ML/DL-based methods. In this paper, firstly we curate a dataset comprising 191 hours of Hindustani Classical Music (HCM) recordings, annotate it for Raga and tonic labels, and train a CNN-LSTM model for the task of Automatic Raga…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Neuroscience and Music Perception
MethodsALIGN
