MedleyVox: An Evaluation Dataset for Multiple Singing Voices Separation

Chang-Bin Jeon; Hyeongi Moon; Keunwoo Choi; Ben Sangbae Chon; and; Kyogu Lee

arXiv:2211.07302·cs.SD·May 5, 2023

MedleyVox: An Evaluation Dataset for Multiple Singing Voices Separation

Chang-Bin Jeon, Hyeongi Moon, Keunwoo Choi, Ben Sangbae Chon, and, Kyogu Lee

PDF

Open Access 1 Repo 1 Models

TL;DR

MedleyVox introduces a new dataset and baseline methods for separating multiple singing voices, addressing a key gap in music source separation research and enabling future advancements in the field.

Contribution

The paper presents MedleyVox, a novel dataset for multiple singing voices separation, along with an improved super-resolution network and a strategy for constructing multi-singing mixtures.

Findings

01

iSRNet enhances separation quality significantly.

02

Joint training with Conv-TasNet yields competitive results.

03

MedleyVox enables standardized evaluation of multi-singing separation.

Abstract

Separation of multiple singing voices into each voice is a rarely studied area in music source separation research. The absence of a benchmark dataset has hindered its progress. In this paper, we present an evaluation dataset and provide baseline studies for multiple singing voices separation. First, we introduce MedleyVox, an evaluation dataset for multiple singing voices separation. We specify the problem definition in this dataset by categorizing it into i) unison, ii) duet, iii) main vs. rest, and iv) N-singing separation. Second, to overcome the absence of existing multi-singing datasets for a training purpose, we present a strategy for construction of multiple singing mixtures using various single-singing datasets. Third, we propose the improved super-resolution network (iSRNet), which greatly enhances initial estimates of separation networks. Jointly trained with the Conv-TasNet…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jeonchangbin49/medleyvox
pytorchOfficial

Models

🤗
Cyru5/MedleyVox
model· ♡ 11
♡ 11

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Music and Audio Processing