Leveraging Sound Source Trajectories for Universal Sound Separation
Donghang Wu, Xihong Wu, Tianshu Qu

TL;DR
This paper introduces a novel sound separation method that iteratively refines source localization and separation for moving sources, significantly improving accuracy in reverberant environments.
Contribution
It proposes a mutual facilitation framework that iteratively enhances both sound source tracking and separation, addressing limitations of prior methods requiring prior localization knowledge.
Findings
Achieves more accurate separation with moving sources.
Refined tracking improves separation performance.
Effective in reverberant conditions.
Abstract
Existing methods utilizing spatial information for sound source separation require prior knowledge of the direction of arrival (DOA) of the source or utilize estimated but imprecise localization results, which impairs the separation performance, especially when the sound sources are moving. In fact, sound source localization and separation are interconnected problems, that is, sound source localization facilitates sound separation while sound separation contributes to refined source localization. This paper proposes a method utilizing the mutual facilitation mechanism between sound source localization and separation for moving sources. The proposed method comprises three stages. The first stage is initial tracking, which tracks each sound source from the audio mixture based on the source signal envelope estimation. These tracking results may lack sufficient accuracy. The second stage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Acoustic Wave Phenomena Research · Music Technology and Sound Studies
