Interpretable Binaural Deep Beamforming Guided by Time-Varying Relative Transfer Function
Ilai Zaidel, Sharon Gannot

TL;DR
This paper introduces a deep learning-based binaural beamforming method guided by time-varying relative transfer functions, improving speech enhancement in dynamic environments while preserving spatial cues for hearables.
Contribution
It presents a novel deep beamforming framework that utilizes continuously tracked RTFs for dynamic target speech enhancement and extends to binaural configurations with realistic spatial rendering.
Findings
RTF guidance produces smoother, more consistent beampatterns.
The binaural system effectively preserves spatial cues like ILD and ITD.
The approach outperforms unguided models in dynamic acoustic scenarios.
Abstract
In this work, we propose a deep beamforming framework for speech enhancement in dynamic acoustic environments. The framework learns time-varying beamformer weights from noisy multichannel signals via a deep neural network, guided by a continuously tracked relative transfer function (RTF) of a moving target speaker. We analyze the network's spatial behavior on an 8-microphone linear array by evaluating narrowband and wideband beampatterns in three modes: (i) oracle guidance with true RTFs, (ii) guidance with subspace-tracked RTF estimates, and (iii) operation without RTF guidance. Results show that RTF guidance yields smoother, more spatially consistent beampatterns that track the target direction of arrival (DOA), whereas the unguided model fails to maintain a clear spatial focus. We further extend the framework to binaural beamforming for dynamic target-speaker enhancement. The system…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Hearing Loss and Rehabilitation · Aerodynamics and Acoustics in Jet Flows
