Adaptive high-precision sound source localization at low frequencies   based on convolutional neural network

Wenbo Ma; Yan Lu; Yijun Liu

arXiv:2409.20031·cs.SD·October 1, 2024

Adaptive high-precision sound source localization at low frequencies based on convolutional neural network

Wenbo Ma, Yan Lu, Yijun Liu

PDF

Open Access

TL;DR

This paper introduces an adaptive convolutional neural network method for high-precision sound source localization at low frequencies, outperforming classical algorithms in accuracy, adaptability, and robustness.

Contribution

It presents a novel CNN-based SSL approach that is adaptive for frequencies below 1kHz, addressing limitations of traditional beamforming methods.

Findings

01

Significantly improved low-frequency localization accuracy.

02

Effective under varying numbers of sound sources and array-to-grid distances.

03

Robust performance demonstrated under different SNR conditions.

Abstract

Sound source localization (SSL) technology plays a crucial role in various application areas such as fault diagnosis, speech separation, and vibration noise reduction. Although beamforming algorithms are widely used in SSL, their resolution at low frequencies is limited. In recent years, deep learning-based SSL methods have significantly improved their accuracy by employing large microphone arrays and training case specific neural networks, however, this could lead to narrow applicability. To address these issues, this paper proposes a convolutional neural network-based method for high-precision SSL, which is adaptive in the lower frequency range under 1kHz with varying numbers of sound sources and microphone array-to-scanning grid distances. It takes the pressure distribution on a relatively small microphone array as input to the neural network, and employs customized training labels…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Flow Measurement and Analysis · Speech Recognition and Synthesis