Mixed-Precision Training for NLP and Speech Recognition with OpenSeq2Seq

Oleksii Kuchaiev; Boris Ginsburg; Igor Gitman; Vitaly Lavrukhin; Jason; Li; Huyen Nguyen; Carl Case; Paulius Micikevicius

arXiv:1805.10387·cs.CL·November 22, 2018·41 cites

Mixed-Precision Training for NLP and Speech Recognition with OpenSeq2Seq

Oleksii Kuchaiev, Boris Ginsburg, Igor Gitman, Vitaly Lavrukhin, Jason, Li, Huyen Nguyen, Carl Case, Paulius Micikevicius

PDF

Open Access 3 Repos

TL;DR

OpenSeq2Seq is a TensorFlow toolkit enabling efficient mixed-precision training for sequence-to-sequence models, achieving state-of-the-art results in NLP and speech recognition with significantly reduced training time.

Contribution

It introduces a versatile, distributed, mixed-precision training toolkit that accelerates training and improves performance across various sequence-to-sequence tasks.

Findings

01

Achieves 1.5-3x faster training times.

02

Provides state-of-the-art performance on translation and speech recognition.

03

Supports diverse sequence-to-sequence applications.

Abstract

We present OpenSeq2Seq - a TensorFlow-based toolkit for training sequence-to-sequence models that features distributed and mixed-precision training. Benchmarks on machine translation and speech recognition tasks show that models built using OpenSeq2Seq give state-of-the-art performance at 1.5-3x less training time. OpenSeq2Seq currently provides building blocks for models that solve a wide range of tasks including neural machine translation, automatic speech recognition, and speech synthesis.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis