Sequence-to-sequence models in peer-to-peer learning: A practical application
Robert \v{S}ajina, Ivo Ip\v{s}i\'c

TL;DR
This study evaluates sequence-to-sequence LSTM models for automatic speech recognition in peer-to-peer learning environments, comparing decentralized peer-to-peer setups with traditional centralized training, and finds feasible performance with slightly higher error rates.
Contribution
It demonstrates the practical application of Seq2Seq models in decentralized peer-to-peer learning for ASR, highlighting their performance relative to centralized training.
Findings
Peer-to-peer learning yields WER of 87-92% on UserLibri dataset.
Centralized training achieves WER of 84% on UserLibri.
Feasibility of Seq2Seq models in decentralized ASR tasks is confirmed.
Abstract
This paper explores the applicability of sequence-to-sequence (Seq2Seq) models based on LSTM units for Automatic Speech Recognition (ASR) task within peer-to-peer learning environments. Leveraging two distinct peer-to-peer learning methods, the study simulates the learning process of agents and evaluates their performance in ASR task using two different ASR datasets. In a centralized training setting, utilizing a scaled-down variant of the Deep Speech 2 model, a single model achieved a Word Error Rate (WER) of 84\% when trained on the UserLibri dataset, and 38\% when trained on the LJ Speech dataset. Conversely, in a peer-to-peer learning scenario involving 55 agents, the WER ranged from 87\% to 92\% for the UserLibri dataset, and from 52\% to 56\% for the LJ Speech dataset. The findings demonstrate the feasibility of employing Seq2Seq models in decentralized settings, albeit with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInnovative Teaching and Learning Methods
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence
