Separake: Source Separation with a Little Help From Echoes
Robin Scheibler, Diego Di Carlo, Antoine Deleforge, Ivan Dokmani\'c

TL;DR
This paper demonstrates that multipath echoes can enhance sound source separation by providing additional spatial diversity, even without full room impulse response estimation, leading to improved performance over traditional methods.
Contribution
It introduces a novel approach that leverages known virtual microphone positions created by echoes to improve source separation, challenging the common belief that multipath is solely detrimental.
Findings
Echoes improve separation performance in standard algorithms.
Using echoes enables separation with magnitude-only information.
Multichannel NMF with echoes outperforms the basic variant.
Abstract
It is commonly believed that multipath hurts various audio processing algorithms. At odds with this belief, we show that multipath in fact helps sound source separation, even with very simple propagation models. Unlike most existing methods, we neither ignore the room impulse responses, nor we attempt to estimate them fully. We rather assume that we know the positions of a few virtual microphones generated by echoes and we show how this gives us enough spatial diversity to get a performance boost over the anechoic case. We show improvements for two standard algorithms---one that uses only magnitudes of the transfer functions, and one that also uses the phases. Concretely, we show that multichannel non-negative matrix factorization aided with a small number of echoes beats the vanilla variant of the same algorithm, and that with magnitude information only, echoes enable separation where…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
