Adaptive high-precision sound source localization at low frequencies based on convolutional neural network
Wenbo Ma, Yan Lu, Yijun Liu

TL;DR
This paper introduces an adaptive convolutional neural network method for high-precision sound source localization at low frequencies, outperforming classical algorithms in accuracy, adaptability, and robustness.
Contribution
It presents a novel CNN-based SSL approach that is adaptive for frequencies below 1kHz, addressing limitations of traditional beamforming methods.
Findings
Significantly improved low-frequency localization accuracy.
Effective under varying numbers of sound sources and array-to-grid distances.
Robust performance demonstrated under different SNR conditions.
Abstract
Sound source localization (SSL) technology plays a crucial role in various application areas such as fault diagnosis, speech separation, and vibration noise reduction. Although beamforming algorithms are widely used in SSL, their resolution at low frequencies is limited. In recent years, deep learning-based SSL methods have significantly improved their accuracy by employing large microphone arrays and training case specific neural networks, however, this could lead to narrow applicability. To address these issues, this paper proposes a convolutional neural network-based method for high-precision SSL, which is adaptive in the lower frequency range under 1kHz with varying numbers of sound sources and microphone array-to-scanning grid distances. It takes the pressure distribution on a relatively small microphone array as input to the neural network, and employs customized training labels…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Flow Measurement and Analysis · Speech Recognition and Synthesis
