Google's Neural Machine Translation System: Bridging the Gap between   Human and Machine Translation

Yonghui Wu; Mike Schuster; Zhifeng Chen; Quoc V. Le; Mohammad Norouzi,; Wolfgang Macherey; Maxim Krikun; Yuan Cao; Qin Gao; Klaus Macherey; Jeff; Klingner; Apurva Shah; Melvin Johnson; Xiaobing Liu; {\L}ukasz Kaiser,; Stephan Gouws; Yoshikiyo Kato; Taku Kudo; Hideto Kazawa; Keith Stevens,; George Kurian; Nishant Patil; Wei Wang; Cliff Young; Jason Smith; Jason; Riesa; Alex Rudnick; Oriol Vinyals; Greg Corrado; Macduff Hughes; Jeffrey; Dean

arXiv:1609.08144·cs.CL·October 11, 2016·5.7k cites

Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation

Yonghui Wu, Mike Schuster, Zhifeng Chen, Quoc V. Le, Mohammad Norouzi,, Wolfgang Macherey, Maxim Krikun, Yuan Cao, Qin Gao, Klaus Macherey, Jeff, Klingner, Apurva Shah, Melvin Johnson, Xiaobing Liu, {\L}ukasz Kaiser,, Stephan Gouws, Yoshikiyo Kato, Taku Kudo, Hideto Kazawa

PDF

Open Access 5 Repos 1 Models

TL;DR

This paper introduces Google's Neural Machine Translation system (GNMT), which significantly improves translation quality and efficiency by addressing computational costs, rare word handling, and inference speed issues in neural translation models.

Contribution

The paper presents a deep LSTM-based NMT model with attention, residual connections, sub-word units, and optimized inference techniques, advancing practical neural translation systems.

Findings

01

Achieves competitive results on WMT'14 benchmarks.

02

Reduces translation errors by 60% compared to phrase-based systems.

03

Employs low-precision arithmetic for faster inference.

Abstract

Neural Machine Translation (NMT) is an end-to-end learning approach for automated translation, with the potential to overcome many of the weaknesses of conventional phrase-based translation systems. Unfortunately, NMT systems are known to be computationally expensive both in training and in translation inference. Also, most NMT systems have difficulty with rare words. These issues have hindered NMT's use in practical deployments and services, where both accuracy and speed are essential. In this work, we present GNMT, Google's Neural Machine Translation system, which attempts to address many of these issues. Our model consists of a deep LSTM network with 8 encoder and 8 decoder layers using attention and residual connections. To improve parallelism and therefore decrease training time, our attention mechanism connects the bottom layer of the decoder to the top layer of the encoder. To…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
monsoon-nlp/bert-base-thai
model· 586 dl· ♡ 13
586 dl♡ 13

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Sigmoid Activation · Tanh Activation · WordPiece · Long Short-Term Memory