Leveraging Sound Source Trajectories for Universal Sound Separation

Donghang Wu; Xihong Wu; Tianshu Qu

arXiv:2409.04843·eess.AS·April 8, 2025

Leveraging Sound Source Trajectories for Universal Sound Separation

Donghang Wu, Xihong Wu, Tianshu Qu

PDF

Open Access

TL;DR

This paper introduces a novel sound separation method that iteratively refines source localization and separation for moving sources, significantly improving accuracy in reverberant environments.

Contribution

It proposes a mutual facilitation framework that iteratively enhances both sound source tracking and separation, addressing limitations of prior methods requiring prior localization knowledge.

Findings

01

Achieves more accurate separation with moving sources.

02

Refined tracking improves separation performance.

03

Effective in reverberant conditions.

Abstract

Existing methods utilizing spatial information for sound source separation require prior knowledge of the direction of arrival (DOA) of the source or utilize estimated but imprecise localization results, which impairs the separation performance, especially when the sound sources are moving. In fact, sound source localization and separation are interconnected problems, that is, sound source localization facilitates sound separation while sound separation contributes to refined source localization. This paper proposes a method utilizing the mutual facilitation mechanism between sound source localization and separation for moving sources. The proposed method comprises three stages. The first stage is initial tracking, which tracks each sound source from the audio mixture based on the source signal envelope estimation. These tracking results may lack sufficient accuracy. The second stage…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Acoustic Wave Phenomena Research · Music Technology and Sound Studies