Recognizing Ornaments in Vocal Indian Art Music with Active Annotation
Sumit Kumar, Parampreet Singh, Vipul Arora

TL;DR
This paper introduces Rga Ornamentation Detection (ROD), a new annotated dataset for recognizing vocal ornaments in Indian classical music, and develops a deep learning model that outperforms baselines in detecting these ornamentations.
Contribution
The paper presents a novel annotated dataset and a deep time-series model for detecting vocal ornamentations in Indian classical music, addressing key challenges in MIR.
Findings
The proposed model outperforms the baseline CRNN in detection accuracy.
The Rga Ornamentation Detection (ROD) dataset enables improved research in ornament recognition.
Experimental results demonstrate the effectiveness of the deep time-series analysis approach.
Abstract
Ornamentations, embellishments, or microtonal inflections are essential to melodic expression across many musical traditions, adding depth, nuance, and emotional impact to performances. Recognizing ornamentations in singing voices is key to MIR, with potential applications in music pedagogy, singer identification, genre classification, and controlled singing voice generation. However, the lack of annotated datasets and specialized modeling approaches remains a major obstacle for progress in this research area. In this work, we introduce R\=aga Ornamentation Detection (ROD), a novel dataset comprising Indian classical music recordings curated by expert musicians. The dataset is annotated using a custom Human-in-the-Loop tool for six vocal ornaments marked as event-based labels. Using this dataset, we develop an ornamentation detection model based on deep time-series analysis, preserving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Time Series Analysis and Forecasting
