Low-power SNN-based audio source localisation using a Hilbert Transform   spike encoding scheme

Saeid Haghighatshoar; Dylan R Muir

arXiv:2402.11748·cs.SD·February 13, 2025·1 cites

Low-power SNN-based audio source localisation using a Hilbert Transform spike encoding scheme

Saeid Haghighatshoar, Dylan R Muir

PDF

Open Access 1 Repo

TL;DR

This paper presents a novel low-power sound source localization method using spiking neural networks and Hilbert transform, achieving high accuracy with significantly reduced power consumption for IoT applications.

Contribution

The paper introduces a Hilbert transform-based encoding scheme and a new event-based phase encoding method for efficient SNN-based audio localization, outperforming existing low-power approaches.

Findings

01

Achieves state-of-the-art accuracy comparable to traditional beamforming.

02

Demonstrates significantly lower power consumption on low-power hardware.

03

Improves efficiency of DSP-based signal processing using the proposed method.

Abstract

Sound source localisation is used in many consumer devices, to isolate audio from individual speakers and reject noise. Localization is frequently accomplished by ``beamforming'', which combines phase-shifted audio streams to increase power from chosen source directions, under a known microphone array geometry. Dense band-pass filters are often needed to obtain narrowband signal components from wideband audio. These approaches achieve high accuracy, but narrowband beamforming is computationally demanding, and not ideal for low-power IoT devices. We demonstrate a novel method for sound source localisation on arbitrary microphone arrays, designed for efficient implementation in ultra-low-power spiking neural networks (SNNs). We use a Hilbert transform to avoid dense band-pass filters, and introduce a new event-based encoding method that captures the phase of the complex analytic signal.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

synsense/haghighatshoarmuir2024
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Music and Audio Processing · Speech Recognition and Synthesis

MethodsSparse Evolutionary Training · Spiking Neural Networks