SHAMaNS: Sound Localization with Hybrid Alpha-Stable Spatial Measure and Neural Steerer
Diego Di Carlo (RIKEN AIP), Mathieu Fontaine (LTCI, IP Paris), Aditya Arie Nugraha (RIKEN AIP), Yoshiaki Bando (RIKEN AIP), Kazuyoshi Yoshii

TL;DR
SHAMaNS introduces a hybrid sound source localization method combining alpha-stable spatial modeling with neural network-based steering vector interpolation, improving accuracy in multi-source scenarios.
Contribution
The paper presents a novel SSL approach that integrates an alpha-stable spatial measure with a physics-informed neural network for steering vector interpolation, enhancing robustness.
Findings
Outperforms state-of-the-art SSL methods in multi-source environments.
Leverages alpha-stable models to handle non-Gaussian noise.
Uses neural networks to interpolate steering vectors for better DOA estimation.
Abstract
This paper describes a sound source localization (SSL) technique that combines an -stable model for the observed signal with a neural network-based approach for modeling steering vectors. Specifically, a physics-informed neural network, referred to as Neural Steerer, is used to interpolate measured steering vectors (SVs) on a fixed microphone array. This allows for a more robust estimation of the so-called -stable spatial measure, which represents the most plausible direction of arrival (DOA) of a target signal. As an -stable model for the non-Gaussian case ( (0, 2)) theoretically defines a unique spatial measure, we choose to leverage it to account for residual reconstruction error of the Neural Steerer in the downstream tasks. The objective scores indicate that our proposed technique outperforms state-of-the-art methods in the case of multiple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Underwater Acoustics Research · Structural Health Monitoring Techniques
