Graph neural networks for sound source localization on distributed microphone networks
Eric Grinstein, Mike Brookes, Patrick A. Naylor

TL;DR
This paper introduces a Graph Neural Network-based method for sound source localization on distributed microphone networks, effectively handling variable input channels and outperforming classical algorithms in experiments.
Contribution
The paper proposes a novel GNN-based localization method that adapts to varying microphone counts, bridging classical SSL algorithms with modern graph neural network techniques.
Findings
Outperforms classical SSL baselines in experiments
Handles variable number of microphones effectively
Uses Relation Network GNN for sound source localization
Abstract
Distributed Microphone Arrays (DMAs) present many challenges with respect to centralized microphone arrays. An important requirement of applications on these arrays is handling a variable number of input channels. We consider the use of Graph Neural Networks (GNNs) as a solution to this challenge. We present a localization method using the Relation Network GNN, which we show shares many similarities to classical signal processing algorithms for Sound Source Localization (SSL). We apply our method for the task of SSL and validate it experimentally using an unseen number of microphones. We test different feature extractors and show that our approach significantly outperforms classical baselines.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Music and Audio Processing · Music Technology and Sound Studies
