Asynchronous Bidirectional Decoding for Neural Machine Translation
Xiangwen Zhang, Jinsong Su, Yue Qin, Yang Liu, Rongrong Ji, Hongji, Wang

TL;DR
This paper introduces an asynchronous bidirectional decoding architecture for neural machine translation, leveraging both forward and backward target-side contexts to enhance translation quality significantly.
Contribution
It proposes a novel bidirectional decoding framework that fully exploits source and target contexts, improving translation performance over traditional unidirectional models.
Findings
Achieved 3.14 BLEU point improvement on Chinese-English translation.
Achieved 1.38 BLEU point improvement on English-German translation.
Demonstrated effectiveness of bidirectional decoding in NMT.
Abstract
The dominant neural machine translation (NMT) models apply unified attentional encoder-decoder neural networks for translation. Traditionally, the NMT decoders adopt recurrent neural networks (RNNs) to perform translation in a left-toright manner, leaving the target-side contexts generated from right to left unexploited during translation. In this paper, we equip the conventional attentional encoder-decoder NMT framework with a backward decoder, in order to explore bidirectional decoding for NMT. Attending to the hidden state sequence produced by the encoder, our backward decoder first learns to generate the target-side hidden state sequence from right to left. Then, the forward decoder performs translation in the forward direction, while in each translation prediction timestep, it simultaneously applies two attention models to consider the source-side and reverse target-side hidden…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Handwritten Text Recognition Techniques
