A DNN based Normalized Time-frequency Weighted Criterion for Robust Wideband DoA Estimation
Kuan-Lin Chen, Ching-Hua Lee, Bhaskar D. Rao, Harinath, Garudadri

TL;DR
This paper introduces a robust DNN-based wideband DoA estimation method using a normalized T-F weighted criterion that enhances accuracy in noisy, reverberant environments without eigendecomposition.
Contribution
The paper proposes a novel normalized T-F weighted criterion for DNN-based DoA estimation that improves robustness against interference and outperforms existing methods.
Findings
Outperforms popular DNN-based DoA methods in noisy environments
Duplicating Hadamard product of speech masks is highly effective
No eigendecomposition required for the proposed method
Abstract
Deep neural networks (DNNs) have greatly benefited direction of arrival (DoA) estimation methods for speech source localization in noisy environments. However, their localization accuracy is still far from satisfactory due to the vulnerability to nonspeech interference. To improve the robustness against interference, we propose a DNN based normalized time-frequency (T-F) weighted criterion which minimizes the distance between the candidate steering vectors and the filtered snapshots in the T-F domain. Our method requires no eigendecomposition and uses a simple normalization to prevent the optimization objective from being misled by noisy filtered snapshots. We also study different designs of T-F weights guided by a DNN. We find that duplicating the Hadamard product of speech ratio masks is highly effective and better than other techniques such as direct masking and taking the mean in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Underwater Acoustics Research · Indoor and Outdoor Localization Technologies
