Variational Neural Machine Translation
Biao Zhang, Deyi Xiong, Jinsong Su, Hong Duan, Min Zhang

TL;DR
This paper introduces a variational neural machine translation model that incorporates a continuous latent variable to better capture source sentence semantics, leading to improved translation quality over traditional encoder-decoder models.
Contribution
The paper proposes a novel variational encoder-decoder framework for neural machine translation, enabling explicit semantic modeling and end-to-end training with efficient inference.
Findings
Significant improvements in Chinese-English translation accuracy.
Enhanced translation quality for English-German tasks.
Effective use of a neural posterior approximator with reparameterization.
Abstract
Models of neural machine translation are often from a discriminative family of encoderdecoders that learn a conditional distribution of a target sentence given a source sentence. In this paper, we propose a variational model to learn this conditional distribution for neural machine translation: a variational encoderdecoder model that can be trained end-to-end. Different from the vanilla encoder-decoder model that generates target translations from hidden representations of source sentences alone, the variational model introduces a continuous latent variable to explicitly model underlying semantics of source sentences and to guide the generation of target translations. In order to perform efficient posterior inference and large-scale training, we build a neural posterior approximator conditioned on both the source and the target sides, and equip it with a reparameterization technique to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
