Exploiting an External Microphone for Binaural RTF-Vector-Based Direction of Arrival Estimation for Multiple Speakers
Daniel Fejgin, Simon Doclo

TL;DR
This paper presents a low-complexity binaural DOA estimation method for multiple speakers that leverages an external microphone to improve accuracy in noisy, reverberant environments, reducing computational load.
Contribution
It introduces a novel RTF vector estimation approach using an external microphone with low spatial coherence assumptions, enhancing DOA estimation efficiency.
Findings
Comparable performance to the covariance whitening method
Lower computational complexity
Effective in noisy, reverberant environments
Abstract
In hearing aid applications, an important objective is to accurately estimate the direction of arrival (DOA) of multiple speakers in noisy and reverberant environments. Recently, we proposed a binaural DOA estimation method, where the DOAs of the speakers are estimated by selecting the directions for which the so-called Hermitian angle spectrum between the estimated relative transfer function (RTF) vector and a database of prototype anechoic RTF vectors is maximized. The RTF vector is estimated using the covariance whitening (CW) method, which requires a computationally complex generalized eigenvalue decomposition. The spatial spectrum is obtained by only considering frequencies where it is likely that one speaker dominates over the other speakers, noise and reverberation. In this contribution, we exploit the availability of an external microphone that is spatially separated from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Direction-of-Arrival Estimation Techniques · Advanced Adaptive Filtering Techniques
