LSTM Benchmarks for Deep Learning Frameworks
Stefan Braun

TL;DR
This paper benchmarks various LSTM implementations across multiple deep learning frameworks, comparing performance in speech recognition scenarios to guide optimal choice for specific tasks.
Contribution
It provides comprehensive performance benchmarks for LSTM units across frameworks and configurations, including different hardware and software versions.
Findings
cuDNN LSTMs outperform other implementations in speed
Performance varies significantly between frameworks and hardware
Fused LSTM variants offer a good balance of speed and flexibility
Abstract
This study provides benchmarks for different implementations of LSTM units between the deep learning frameworks PyTorch, TensorFlow, Lasagne and Keras. The comparison includes cuDNN LSTMs, fused LSTM variants and less optimized, but more flexible LSTM implementations. The benchmarks reflect two typical scenarios for automatic speech recognition, notably continuous speech recognition and isolated digit recognition. These scenarios cover input sequences of fixed and variable length as well as the loss functions CTC and cross entropy. Additionally, a comparison between four different PyTorch versions is included. The code is available online https://github.com/stefbraun/rnn_benchmarks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Topic Modeling · Natural Language Processing Techniques
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
