Improving speech recognition by revising gated recurrent units

Mirco Ravanelli; Philemon Brakel; Maurizio Omologo; Yoshua Bengio

arXiv:1710.00641·cs.CL·October 3, 2017

Improving speech recognition by revising gated recurrent units

Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, Yoshua Bengio

PDF

1 Repo

TL;DR

This paper proposes a simplified GRU architecture for speech recognition by removing the reset gate and using ReLU activations, leading to faster training and improved accuracy across various conditions.

Contribution

It introduces a novel, more efficient GRU variant tailored for speech recognition, enhancing performance and reducing training time.

Findings

01

Training time reduced by over 30%

02

Consistent improvement in recognition accuracy

03

Effective across different tasks and noisy environments

Abstract

Speech recognition is largely taking advantage of deep learning, showing that substantial benefits can be obtained by modern Recurrent Neural Networks (RNNs). The most popular RNNs are Long Short-Term Memory (LSTMs), which typically reach state-of-the-art performance in many tasks thanks to their ability to learn long-term dependencies and robustness to vanishing gradients. Nevertheless, LSTMs have a rather complex design with three multiplicative gates, that might impair their efficient implementation. An attempt to simplify LSTMs has recently led to Gated Recurrent Units (GRUs), which are based on just two multiplicative gates. This paper builds on these efforts by further revising GRUs and proposing a simplified architecture potentially more suitable for speech recognition. The contribution of this work is two-fold. First, we suggest to remove the reset gate in the GRU design,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mravanelli/theano-kaldi-rnn
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Gated Recurrent Unit