Deep domain adaptation for polyphonic melody extraction
Kavya Ranjan Saxena, Vipul Arora

TL;DR
This paper explores domain adaptation techniques for polyphonic melody extraction, demonstrating that meta-learning-based adaptation outperforms traditional fine-tuning and existing algorithms, improving model performance across different music domains.
Contribution
The study introduces and evaluates meta-learning-based adaptation methods for polyphonic melody extraction, showing they outperform fine-tuning and current state-of-the-art algorithms.
Findings
Meta-learning adaptation outperforms fine-tuning.
Meta-learning surpasses existing non-adaptive algorithms.
Adaptive methods improve cross-domain melody extraction performance.
Abstract
Extraction of the predominant pitch from polyphonic audio is one of the fundamental tasks in the field of music information retrieval and computational musicology. To accomplish this task using machine learning, a large amount of labeled audio data is required to train the model that predicts the pitch contour. But a classical model pre-trained on data from one domain (source), e.g, songs of a particular singer or genre, may not perform comparatively well in extracting melody from other domains (target). The performance of such models can be boosted by adapting the model using some annotated data in the target domain. In this work, we study various adaptation techniques applied to machine learning models for polyphonic melody extraction. Experimental results show that meta-learning-based adaptation performs better than simple fine-tuning. In addition to this, we find that this method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Speech and Audio Processing
