Improving Source Separation by Explicitly Modeling Dependencies Between Sources
Ethan Manilow, Curtis Hawthorne, Cheng-Zhi Anna Huang, Bryan Pardo,, Jesse Engel

TL;DR
This paper introduces a novel source separation method that models interdependencies between sources using an Orderless NADE framework, improving separation quality through iterative Gibbs sampling.
Contribution
It reframes source separation as an Orderless NADE problem and integrates Gibbs sampling for iterative refinement, a novel approach in this context.
Findings
Significant performance improvements over baseline models.
Effective modeling of source dependencies enhances separation quality.
Iterative Gibbs sampling refines source estimates over multiple steps.
Abstract
We propose a new method for training a supervised source separation system that aims to learn the interdependent relationships between all combinations of sources in a mixture. Rather than independently estimating each source from a mix, we reframe the source separation problem as an Orderless Neural Autoregressive Density Estimator (NADE), and estimate each source from both the mix and a random subset of the other sources. We adapt a standard source separation architecture, Demucs, with additional inputs for each individual source, in addition to the input mixture. We randomly mask these input sources during training so that the network learns the conditional dependencies between the sources. By pairing this training method with a block Gibbs sampling procedure at inference time, we demonstrate that the network can iteratively improve its separation performance by conditioning a source…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Music and Audio Processing
