Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu, Mike Schuster, Zhifeng Chen, Quoc V. Le, Mohammad Norouzi,, Wolfgang Macherey, Maxim Krikun, Yuan Cao, Qin Gao, Klaus Macherey, Jeff, Klingner, Apurva Shah, Melvin Johnson, Xiaobing Liu, {\L}ukasz Kaiser,, Stephan Gouws, Yoshikiyo Kato, Taku Kudo, Hideto Kazawa

TL;DR
This paper introduces Google's Neural Machine Translation system (GNMT), which significantly improves translation quality and efficiency by addressing computational costs, rare word handling, and inference speed issues in neural translation models.
Contribution
The paper presents a deep LSTM-based NMT model with attention, residual connections, sub-word units, and optimized inference techniques, advancing practical neural translation systems.
Findings
Achieves competitive results on WMT'14 benchmarks.
Reduces translation errors by 60% compared to phrase-based systems.
Employs low-precision arithmetic for faster inference.
Abstract
Neural Machine Translation (NMT) is an end-to-end learning approach for automated translation, with the potential to overcome many of the weaknesses of conventional phrase-based translation systems. Unfortunately, NMT systems are known to be computationally expensive both in training and in translation inference. Also, most NMT systems have difficulty with rare words. These issues have hindered NMT's use in practical deployments and services, where both accuracy and speed are essential. In this work, we present GNMT, Google's Neural Machine Translation system, which attempts to address many of these issues. Our model consists of a deep LSTM network with 8 encoder and 8 decoder layers using attention and residual connections. To improve parallelism and therefore decrease training time, our attention mechanism connects the bottom layer of the decoder to the top layer of the encoder. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Sigmoid Activation · Tanh Activation · WordPiece · Long Short-Term Memory
