Lexical Translation Model Using a Deep Neural Network Architecture

Thanh-Le Ha; Jan Niehues; Alex Waibel

arXiv:1504.07395·cs.CL·April 29, 2015·1 cites

Lexical Translation Model Using a Deep Neural Network Architecture

Thanh-Le Ha, Jan Niehues, Alex Waibel

PDF

Open Access

TL;DR

This paper introduces a neural network-based lexical translation model that leverages global source context and shared parameters to improve translation quality, reducing data sparsity issues and achieving up to 0.5 BLEU point improvements.

Contribution

It presents a novel deep neural network architecture for lexical translation that integrates global context and shared parameters, enhancing translation performance over previous models.

Findings

01

Achieved up to 0.5 BLEU point improvement on TED translation tasks.

02

Effectively reduces data sparsity through shared parameters.

03

Leverages non-linear dependencies between source words.

Abstract

In this paper we combine the advantages of a model using global source sentence contexts, the Discriminative Word Lexicon, and neural networks. By using deep neural networks instead of the linear maximum entropy model in the Discriminative Word Lexicon models, we are able to leverage dependencies between different source words due to the non-linearity. Furthermore, the models for different target words can share parameters and therefore data sparsity problems are effectively reduced. By using this approach in a state-of-the-art translation system, we can improve the performance by up to 0.5 BLEU points for three different language pairs on the TED translation task.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Handwritten Text Recognition Techniques