LSTM Benchmarks for Deep Learning Frameworks

Stefan Braun

arXiv:1806.01818·cs.LG·June 6, 2018·19 cites

LSTM Benchmarks for Deep Learning Frameworks

Stefan Braun

PDF

Open Access 1 Repo

TL;DR

This paper benchmarks various LSTM implementations across multiple deep learning frameworks, comparing performance in speech recognition scenarios to guide optimal choice for specific tasks.

Contribution

It provides comprehensive performance benchmarks for LSTM units across frameworks and configurations, including different hardware and software versions.

Findings

01

cuDNN LSTMs outperform other implementations in speed

02

Performance varies significantly between frameworks and hardware

03

Fused LSTM variants offer a good balance of speed and flexibility

Abstract

This study provides benchmarks for different implementations of LSTM units between the deep learning frameworks PyTorch, TensorFlow, Lasagne and Keras. The comparison includes cuDNN LSTMs, fused LSTM variants and less optimized, but more flexible LSTM implementations. The benchmarks reflect two typical scenarios for automatic speech recognition, notably continuous speech recognition and isolated digit recognition. These scenarios cover input sequences of fixed and variable length as well as the loss functions CTC and cross entropy. Additionally, a comparison between four different PyTorch versions is included. The code is available online https://github.com/stefbraun/rnn_benchmarks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

stefbraun/rnn_benchmarks
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Topic Modeling · Natural Language Processing Techniques

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory