Hierarchical Generative Modeling of Melodic Vocal Contours in Hindustani Classical Music
Nithya Shikarpur, Krishna Maneesha Dendukuri, Yusong Wu, Antoine, Caillon, Cheng-Zhi Anna Huang

TL;DR
This paper introduces GaMaDHaNi, a hierarchical generative model for Hindustani vocal melodies that captures rich melodic nuances using a finely quantized pitch contour representation, improving musical fidelity and interaction capabilities.
Contribution
It presents a novel hierarchical modeling approach using pitch contours as an intermediate representation for Hindustani singing, enhancing expressive melody generation.
Findings
Model outperforms non-hierarchical audio models in capturing pitch contours.
Hierarchical approach improves musical fidelity in generated melodies.
Potential for better human-AI musical collaboration through primed generation and pitch conditioning.
Abstract
Hindustani music is a performance-driven oral tradition that exhibits the rendition of rich melodic patterns. In this paper, we focus on generative modeling of singers' vocal melodies extracted from audio recordings, as the voice is musically prominent within the tradition. Prior generative work in Hindustani music models melodies as coarse discrete symbols which fails to capture the rich expressive melodic intricacies of singing. Thus, we propose to use a finely quantized pitch contour, as an intermediate representation for hierarchical audio modeling. We propose GaMaDHaNi, a modular two-level hierarchy, consisting of a generative model on pitch contours, and a pitch contour to audio synthesis model. We compare our approach to non-hierarchical audio models and hierarchical models that use a self-supervised intermediate representation, through a listening test and qualitative analysis.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Musicology and Musical Analysis
MethodsFocus
