Heterogeneous Reservoir Computing Models for Persian Speech Recognition
Zohreh Ansari, Farzin Pourhoseini, Fatemeh Hadaeghi

TL;DR
This paper introduces heterogeneous reservoir computing models for Persian speech recognition, demonstrating improved accuracy and reduced training time compared to standard RC and comparable performance to LSTM models.
Contribution
The paper proposes heterogeneous single and multi-layer ESNs to enhance RC performance in Persian ASR, a novel application for this technology.
Findings
Heterogeneous RC models outperform standard RC in accuracy.
Heterogeneous RC models match LSTM recognition accuracy.
Heterogeneous RC models significantly reduce training time.
Abstract
Over the last decade, deep-learning methods have been gradually incorporated into conventional automatic speech recognition (ASR) frameworks to create acoustic, pronunciation, and language models. Although it led to significant improvements in ASRs' recognition accuracy, due to their hard constraints related to hardware requirements (e.g., computing power and memory usage), it is unclear if such approaches are the most computationally- and energy-efficient options for embedded ASR applications. Reservoir computing (RC) models (e.g., echo state networks (ESNs) and liquid state machines (LSMs)), on the other hand, have been proven inexpensive to train, have vastly fewer parameters, and are compatible with emergent hardware technologies. However, their performance in speech processing tasks is relatively inferior to that of the deep-learning-based models. To enhance the accuracy of the RC…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Advanced Memory and Neural Computing · Neural Networks and Applications
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
