Semi-supervised source localization in reverberant environments with deep generative modeling
Michael J. Bianco, Sharon Gannot, Efren Fernandez-Grande, and Peter, Gerstoft

TL;DR
This paper introduces a semi-supervised deep generative model for acoustic source localization in reverberant environments, effectively leveraging limited labeled data to improve localization accuracy and interpretability.
Contribution
It presents the first deep generative modeling approach for physical acoustic propagation, combining semi-supervised learning with VAEs for source localization.
Findings
VAE-SSL outperforms traditional methods in label-limited scenarios.
The approach can generate new RTF-phase samples, demonstrating understanding of acoustic physics.
VAE-SSL achieves better localization accuracy than CNNs and classical algorithms.
Abstract
We propose a semi-supervised approach to acoustic source localization in reverberant environments based on deep generative modeling. Localization in reverberant environments remains an open challenge. Even with large data volumes, the number of labels available for supervised learning in reverberant environments is usually small. We address this issue by performing semi-supervised learning (SSL) with convolutional variational autoencoders (VAEs) on reverberant speech signals recorded with microphone arrays. The VAE is trained to generate the phase of relative transfer functions (RTFs) between microphones, in parallel with a direction of arrival (DOA) classifier based on RTF-phase. These models are trained using both labeled and unlabeled RTF-phase sequences. In learning to perform these tasks, the VAE-SSL explicitly learns to separate the physical causes of the RTF-phase (i.e., source…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsUSD Coin Customer Service Number +1-833-534-1729
