SAGRNN: Self-Attentive Gated RNN for Binaural Speaker Separation with   Interaural Cue Preservation

Ke Tan; Buye Xu; Anurag Kumar; Eliya Nachmani; Yossi Adi

arXiv:2009.01381·eess.AS·February 3, 2021

SAGRNN: Self-Attentive Gated RNN for Binaural Speaker Separation with Interaural Cue Preservation

Ke Tan, Buye Xu, Anurag Kumar, Eliya Nachmani, Yossi Adi

PDF

1 Repo

TL;DR

This paper introduces SAGRNN, a novel neural network model that improves binaural speaker separation while preserving interaural cues, enhancing sound localization accuracy in noisy environments.

Contribution

It extends gated RNNs with self-attention and dense connectivity for end-to-end binaural separation with cue preservation, a novel approach in the field.

Findings

01

Significantly better separation performance than recent methods.

02

Effective preservation of interaural cues for sound localization.

03

Improved accuracy in localizing speakers in complex environments.

Abstract

Most existing deep learning based binaural speaker separation systems focus on producing a monaural estimate for each of the target speakers, and thus do not preserve the interaural cues, which are crucial for human listeners to perform sound localization and lateralization. In this study, we address talker-independent binaural speaker separation with interaural cues preserved in the estimated binaural signals. Specifically, we extend a newly-developed gated recurrent neural network for monaural separation by additionally incorporating self-attention mechanisms and dense connectivity. We develop an end-to-end multiple-input multiple-output system, which directly maps from the binaural waveform of the mixture to those of the speech signals. The experimental results show that our proposed approach achieves significantly better separation performance than a recent binaural separation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

JupiterEthan/sagrnn.github.io
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.