Towards efficient end-to-end speech recognition with biologically-inspired neural networks
Thomas Bohnstingl, Ayush Garg, Stanis{\l}aw Wo\'zniak, George Saon,, Evangelos Eleftheriou, Angeliki Pantazi

TL;DR
This paper introduces biologically-inspired neural network units into end-to-end speech recognition models, achieving competitive accuracy with reduced computational cost and latency by emulating neural dynamics of the human brain.
Contribution
It presents novel neuro-synaptic units inspired by brain dynamics integrated into RNN-T, enhancing biological plausibility and efficiency in large-scale ASR systems.
Findings
Achieved competitive speech recognition accuracy with biologically-inspired models.
Demonstrated reduced computational cost and latency in ASR.
First large-scale biologically realistic ASR implementation.
Abstract
Automatic speech recognition (ASR) is a capability which enables a program to process human speech into a written form. Recent developments in artificial intelligence (AI) have led to high-accuracy ASR systems based on deep neural networks, such as the recurrent neural network transducer (RNN-T). However, the core components and the performed operations of these approaches depart from the powerful biological counterpart, i.e., the human brain. On the other hand, the current developments in biologically-inspired ASR models, based on spiking neural networks (SNNs), lag behind in terms of accuracy and focus primarily on small scale applications. In this work, we revisit the incorporation of biologically-plausible models into deep learning and we substantially enhance their capabilities, by taking inspiration from the diverse neural and synaptic dynamics found in the brain. In particular,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Neural Networks and Applications · Ferroelectric and Negative Capacitance Devices
