Incremental Averaging Method to Improve Graph-Based Time-Difference-of-Arrival Estimation
Klaus Br\"umann, Kouei Yamaoka, Nobutaka Ono, Simon Doclo

TL;DR
This paper introduces an incremental averaging approach for GCC-PHAT functions in graph-based TDOA estimation, enhancing accuracy in noisy, reverberant environments by leveraging multiple spectral density averages.
Contribution
It proposes a novel incremental averaging method for GCC-PHAT functions that improves TDOA and source localization accuracy under challenging acoustic conditions.
Findings
Improved TDOA estimation accuracy in noisy environments.
Enhanced 2D source localization precision.
Robust performance across various microphone configurations.
Abstract
Estimating the position of a speech source based on time-differences-of-arrival (TDOAs) is often adversely affected by background noise and reverberation. A popular method to estimate the TDOA between a microphone pair involves maximizing a generalized cross-correlation with phase transform (GCC-PHAT) function. Since the TDOAs across different microphone pairs satisfy consistency relations, generally only a small subset of microphone pairs are used for source position estimation. Although the set of microphone pairs is often determined based on a reference microphone, recently a more robust method has been proposed to determine the set of microphone pairs by computing the minimum spanning tree (MST) of a signal graph of GCC-PHAT function reliabilities. To reduce the influence of noise and reverberation on the TDOA estimation accuracy, in this paper we propose to compute the GCC-PHAT…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Indoor and Outdoor Localization Technologies · Music and Audio Processing
