Graph neural networks for sound source localization on distributed   microphone networks

Eric Grinstein; Mike Brookes; Patrick A. Naylor

arXiv:2306.16081·cs.SD·June 29, 2023

Graph neural networks for sound source localization on distributed microphone networks

Eric Grinstein, Mike Brookes, Patrick A. Naylor

PDF

Open Access 1 Repo

TL;DR

This paper introduces a Graph Neural Network-based method for sound source localization on distributed microphone networks, effectively handling variable input channels and outperforming classical algorithms in experiments.

Contribution

The paper proposes a novel GNN-based localization method that adapts to varying microphone counts, bridging classical SSL algorithms with modern graph neural network techniques.

Findings

01

Outperforms classical SSL baselines in experiments

02

Handles variable number of microphones effectively

03

Uses Relation Network GNN for sound source localization

Abstract

Distributed Microphone Arrays (DMAs) present many challenges with respect to centralized microphone arrays. An important requirement of applications on these arrays is handling a variable number of input channels. We consider the use of Graph Neural Networks (GNNs) as a solution to this challenge. We present a localization method using the Relation Network GNN, which we show shares many similarities to classical signal processing algorithms for Sound Source Localization (SSL). We apply our method for the task of SSL and validate it experimentally using an unseen number of microphones. We test different feature extractors and show that our approach significantly outperforms classical baselines.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

egrinstein/gnn_ssl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Music and Audio Processing · Music Technology and Sound Studies