Music Source Separation with Generative Flow

Ge Zhu; Jordan Darefsky; Fei Jiang; Anton Selitskiy; Zhiyao Duan

arXiv:2204.09079·eess.AS·November 30, 2022

Music Source Separation with Generative Flow

Ge Zhu, Jordan Darefsky, Fei Jiang, Anton Selitskiy, Zhiyao Duan

PDF

1 Repo

TL;DR

This paper introduces a flow-based generative model for music source separation that requires only individual source data, enabling flexible addition of new sources and achieving competitive results with fully-supervised methods.

Contribution

The paper proposes a novel source-only supervised approach using flow-based generators for music separation, reducing data requirements and increasing flexibility.

Findings

01

Competitive performance in singing voice separation

02

Flexible addition of new source types without retraining

03

Effective use of flow-based priors for source modeling

Abstract

Fully-supervised models for source separation are trained on parallel mixture-source data and are currently state-of-the-art. However, such parallel data is often difficult to obtain, and it is cumbersome to adapt trained models to mixtures with new sources. Source-only supervised models, in contrast, only require individual source data for training. In this paper, we first leverage flow-based generators to train individual music source priors and then use these models, along with likelihood-based objectives, to separate music mixtures. We show that in singing voice separation and music separation tasks, our proposed method is competitive with a fully-supervised approach. We also demonstrate that we can flexibly add new types of sources, whereas fully-supervised approaches would require retraining of the entire model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gzhu06/generativesourceseparation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.