Reverberant Sound Localization with a Robot Head Based on Direct-Path Relative Transfer Function
Xiaofei Li, Laurent Girin, Fabien Badeig, Radu Horaud

TL;DR
This paper presents a novel method for localizing speech sources with a robot head in reverberant environments by estimating the direct-path relative transfer function from noisy signals, improving accuracy over existing methods.
Contribution
The authors introduce a new approach to estimate the direct-path relative transfer function in noisy, reverberant settings, enhancing sound-source localization for robots.
Findings
Outperforms two state-of-the-art SSL methods in experiments
Effective in various reverberant environments
Uses DP-RTF as a robust feature for localization
Abstract
This paper addresses the problem of sound-source localization (SSL) with a robot head, which remains a challenge in real-world environments. In particular we are interested in locating speech sources, as they are of high interest for human-robot interaction. The microphone-pair response corresponding to the direct-path sound propagation is a function of the source direction. In practice, this response is contaminated by noise and reverberations. The direct-path relative transfer function (DP-RTF) is defined as the ratio between the direct-path acoustic transfer function (ATF) of the two microphones, and it is an important feature for SSL. We propose a method to estimate the DP-RTF from noisy and reverberant signals in the short-time Fourier transform (STFT) domain. First, the convolutive transfer function (CTF) approximation is adopted to accurately represent the impulse response of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
