T-NGA: Temporal Network Grafting Algorithm for Learning to Process Spiking Audio Sensor Events
Shu Wang, Yuhuang Hu, Shih-Chii Liu

TL;DR
This paper introduces T-NGA, a self-supervised algorithm that enables training neural networks on spiking audio sensor data without large labeled datasets, achieving near-supervised accuracy in speech recognition tasks.
Contribution
The paper presents a novel self-supervised method for training neural networks on spiking cochlea data by grafting pretrained recurrent networks, bypassing the need for large labeled datasets.
Findings
Grafted networks achieve similar accuracy to supervised networks on speech recognition.
T-NGA maintains high accuracy despite silicon cochlea non-idealities.
Method reduces reliance on large labeled spike datasets.
Abstract
Spiking silicon cochlea sensors encode sound as an asynchronous stream of spikes from different frequency channels. The lack of labeled training datasets for spiking cochleas makes it difficult to train deep neural networks on the outputs of these sensors. This work proposes a self-supervised method called Temporal Network Grafting Algorithm (T-NGA), which grafts a recurrent network pretrained on spectrogram features so that the network works with the cochlea event features. T-NGA training requires only temporally aligned audio spectrograms and event features. Our experiments show that the accuracy of the grafted network was similar to the accuracy of a supervised network trained from scratch on a speech recognition task using events from a software spiking cochlea model. Despite the circuit non-idealities of the spiking silicon cochlea, the grafted network accuracy on the silicon…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHearing Loss and Rehabilitation · Speech and Audio Processing · Hearing, Cochlea, Tinnitus, Genetics
