Distance-Based Sound Separation
Katharine Patterson, Kevin Wilson, Scott Wisdom, John R. Hershey

TL;DR
This paper introduces distance-based sound separation, a new task where sounds are separated based solely on their proximity to a microphone, aiding focus in noisy environments.
Contribution
It presents a neural network approach for separating near and far sounds using distance as the criterion, demonstrating feasibility in synthetic reverberant conditions.
Findings
Achieved 4.4 dB SI-SNR improvement for near sounds
Achieved 6.8 dB SI-SNR improvement for far sounds
Validated the approach with a single microphone and multiple speakers
Abstract
We propose the novel task of distance-based sound separation, where sounds are separated based only on their distance from a single microphone. In the context of assisted listening devices, proximity provides a simple criterion for sound selection in noisy environments that would allow the user to focus on sounds relevant to a local conversation. We demonstrate the feasibility of this approach by training a neural network to separate near sounds from far sounds in single channel synthetic reverberant mixtures, relative to a threshold distance defining the boundary between near and far. With a single nearby speaker and four distant speakers, the model improves scale-invariant signal to noise ratio by 4.4 dB for near sounds and 6.8 dB for far sounds.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Advanced Adaptive Filtering Techniques · Blind Source Separation Techniques
