Weakly Supervised Audio Source Separation via Spectrum Energy Preserved   Wasserstein Learning

Ning Zhang; Junchi Yan; Yuchen Zhou

arXiv:1711.04121·cs.SD·May 18, 2018·2 cites

Weakly Supervised Audio Source Separation via Spectrum Energy Preserved Wasserstein Learning

Ning Zhang, Junchi Yan, Yuchen Zhou

PDF

Open Access

TL;DR

This paper presents a novel weakly supervised deep learning method for audio source separation that uses Wasserstein distance and spectrum energy preservation, achieving competitive results without extensive prior assumptions.

Contribution

Introduces a spectrum energy preserved Wasserstein learning framework for weakly supervised audio source separation, reducing the need for prior model constraints.

Findings

01

Performs competitively on benchmark datasets.

02

Requires minimal prior model assumptions.

03

End-to-end training capability.

Abstract

Separating audio mixtures into individual instrument tracks has been a long standing challenging task. We introduce a novel weakly supervised audio source separation approach based on deep adversarial learning. Specifically, our loss function adopts the Wasserstein distance which directly measures the distribution distance between the separated sources and the real sources for each individual source. Moreover, a global regularization term is added to fulfill the spectrum energy preservation property regardless separation. Unlike state-of-the-art weakly supervised models which often involve deliberately devised constraints or careful model selection, our approach need little prior model specification on the data, and can be straightforwardly learned in an end-to-end fashion. We show that the proposed method performs competitively on public benchmark against state-of-the-art weakly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Music and Audio Processing · Blind Source Separation Techniques