Distance-Based Sound Separation

Katharine Patterson; Kevin Wilson; Scott Wisdom; John R. Hershey

arXiv:2207.00562·cs.SD·July 4, 2022

Distance-Based Sound Separation

Katharine Patterson, Kevin Wilson, Scott Wisdom, John R. Hershey

PDF

Open Access

TL;DR

This paper introduces distance-based sound separation, a new task where sounds are separated based solely on their proximity to a microphone, aiding focus in noisy environments.

Contribution

It presents a neural network approach for separating near and far sounds using distance as the criterion, demonstrating feasibility in synthetic reverberant conditions.

Findings

01

Achieved 4.4 dB SI-SNR improvement for near sounds

02

Achieved 6.8 dB SI-SNR improvement for far sounds

03

Validated the approach with a single microphone and multiple speakers

Abstract

We propose the novel task of distance-based sound separation, where sounds are separated based only on their distance from a single microphone. In the context of assisted listening devices, proximity provides a simple criterion for sound selection in noisy environments that would allow the user to focus on sounds relevant to a local conversation. We demonstrate the feasibility of this approach by training a neural network to separate near sounds from far sounds in single channel synthetic reverberant mixtures, relative to a threshold distance defining the boundary between near and far. With a single nearby speaker and four distant speakers, the model improves scale-invariant signal to noise ratio by 4.4 dB for near sounds and 6.8 dB for far sounds.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Advanced Adaptive Filtering Techniques · Blind Source Separation Techniques