Optimal Condition Training for Target Source Separation
Efthymios Tzinis, Gordon Wichern, Paris Smaragdis, Jonathan Le Roux

TL;DR
This paper introduces an optimal condition training method for single-channel target source separation that leverages multiple semantic concepts and condition refinement to improve source disentanglement and achieve state-of-the-art results.
Contribution
The paper proposes a novel optimal condition training approach with condition refinement for improved source separation, outperforming existing models and handling diverse semantic concepts.
Findings
Significantly improves source separation efficiency.
Outperforms permutation invariant models with oracle assignment.
Achieves state-of-the-art in text-based source separation.
Abstract
Recent research has shown remarkable performance in leveraging multiple extraneous conditional and non-mutually exclusive semantic concepts for sound source separation, allowing the flexibility to extract a given target source based on multiple different queries. In this work, we propose a new optimal condition training (OCT) method for single-channel target source separation, based on greedy parameter updates using the highest performing condition among equivalent conditions associated with a given target source. Our experiments show that the complementary information carried by the diverse semantic concepts significantly helps to disentangle and isolate sources of interest much more efficiently compared to single-conditioned models. Moreover, we propose a variation of OCT with condition refinement, in which an initial conditional vector is adapted to the given mixture and transformed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Music and Audio Processing
