Low-power SNN-based audio source localisation using a Hilbert Transform spike encoding scheme
Saeid Haghighatshoar, Dylan R Muir

TL;DR
This paper presents a novel low-power sound source localization method using spiking neural networks and Hilbert transform, achieving high accuracy with significantly reduced power consumption for IoT applications.
Contribution
The paper introduces a Hilbert transform-based encoding scheme and a new event-based phase encoding method for efficient SNN-based audio localization, outperforming existing low-power approaches.
Findings
Achieves state-of-the-art accuracy comparable to traditional beamforming.
Demonstrates significantly lower power consumption on low-power hardware.
Improves efficiency of DSP-based signal processing using the proposed method.
Abstract
Sound source localisation is used in many consumer devices, to isolate audio from individual speakers and reject noise. Localization is frequently accomplished by ``beamforming'', which combines phase-shifted audio streams to increase power from chosen source directions, under a known microphone array geometry. Dense band-pass filters are often needed to obtain narrowband signal components from wideband audio. These approaches achieve high accuracy, but narrowband beamforming is computationally demanding, and not ideal for low-power IoT devices. We demonstrate a novel method for sound source localisation on arbitrary microphone arrays, designed for efficient implementation in ultra-low-power spiking neural networks (SNNs). We use a Hilbert transform to avoid dense band-pass filters, and introduce a new event-based encoding method that captures the phase of the complex analytic signal.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Music and Audio Processing · Speech Recognition and Synthesis
MethodsSparse Evolutionary Training · Spiking Neural Networks
