Optimal Condition Training for Target Source Separation

Efthymios Tzinis; Gordon Wichern; Paris Smaragdis; Jonathan Le Roux

arXiv:2211.05927·cs.SD·November 14, 2022

Optimal Condition Training for Target Source Separation

Efthymios Tzinis, Gordon Wichern, Paris Smaragdis, Jonathan Le Roux

PDF

Open Access 1 Repo

TL;DR

This paper introduces an optimal condition training method for single-channel target source separation that leverages multiple semantic concepts and condition refinement to improve source disentanglement and achieve state-of-the-art results.

Contribution

The paper proposes a novel optimal condition training approach with condition refinement for improved source separation, outperforming existing models and handling diverse semantic concepts.

Findings

01

Significantly improves source separation efficiency.

02

Outperforms permutation invariant models with oracle assignment.

03

Achieves state-of-the-art in text-based source separation.

Abstract

Recent research has shown remarkable performance in leveraging multiple extraneous conditional and non-mutually exclusive semantic concepts for sound source separation, allowing the flexibility to extract a given target source based on multiple different queries. In this work, we propose a new optimal condition training (OCT) method for single-channel target source separation, based on greedy parameter updates using the highest performing condition among equivalent conditions associated with a given target source. Our experiments show that the complementary information carried by the diverse semantic concepts significantly helps to disentangle and isolate sources of interest much more efficiently compared to single-conditioned models. Moreover, we propose a variation of OCT with condition refinement, in which an initial conditional vector is adapted to the given mixture and transformed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

etzinis/optimal_condition_training
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Music and Audio Processing