Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Kyunghyun Cho, Bart van Merrienboer, Caglar Gulcehre, Dzmitry, Bahdanau, Fethi Bougares, Holger Schwenk, Yoshua Bengio

TL;DR
This paper introduces a neural network model called RNN Encoder-Decoder that learns phrase representations to improve statistical machine translation by capturing semantic and syntactic information.
Contribution
The paper presents a novel RNN Encoder-Decoder model that enhances phrase representation learning for machine translation, improving translation quality.
Findings
Improved translation performance using RNN-based phrase probabilities
The model learns meaningful semantic and syntactic phrase representations
Enhancement of existing translation models with RNN-derived features
Abstract
In this paper, we propose a novel neural network model called RNN Encoder-Decoder that consists of two recurrent neural networks (RNN). One RNN encodes a sequence of symbols into a fixed-length vector representation, and the other decodes the representation into another sequence of symbols. The encoder and decoder of the proposed model are jointly trained to maximize the conditional probability of a target sequence given a source sequence. The performance of a statistical machine translation system is empirically found to improve by using the conditional probabilities of phrase pairs computed by the RNN Encoder-Decoder as an additional feature in the existing log-linear model. Qualitatively, we show that the proposed model learns a semantically and syntactically meaningful representation of linguistic phrases.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis
MethodsGated Recurrent Unit
