Feature-based Decipherment for Large Vocabulary Machine Translation

Iftekhar Naim; Daniel Gildea

arXiv:1508.02142·cs.CL·August 11, 2015·2 cites

Feature-based Decipherment for Large Vocabulary Machine Translation

Iftekhar Naim, Daniel Gildea

PDF

Open Access

TL;DR

This paper introduces a log-linear decipherment model that leverages orthographic similarities for large vocabulary machine translation, improving performance over existing models by using approximate inference techniques.

Contribution

It presents a novel log-linear model with orthographic features and an efficient inference method for large vocabulary decipherment tasks.

Findings

01

Outperforms existing generative decipherment models

02

Scales effectively to large vocabularies

03

Utilizes orthographic features for better translation accuracy

Abstract

Orthographic similarities across languages provide a strong signal for probabilistic decipherment, especially for closely related language pairs. The existing decipherment models, however, are not well-suited for exploiting these orthographic similarities. We propose a log-linear model with latent variables that incorporates orthographic similarity features. Maximum likelihood training is computationally expensive for the proposed log-linear model. To address this challenge, we perform approximate inference via MCMC sampling and contrastive divergence. Our results show that the proposed log-linear model with contrastive divergence scales to large vocabularies and outperforms the existing generative decipherment models by exploiting the orthographic features.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications