Robust Tuning Datasets for Statistical Machine Translation

Preslav Nakov; Stephan Vogel

arXiv:1710.00346·cs.CL·October 3, 2017

Robust Tuning Datasets for Statistical Machine Translation

Preslav Nakov, Stephan Vogel

PDF

Open Access

TL;DR

This paper proposes automatically selecting subsets of sentences for tuning SMT systems to improve robustness and efficiency, demonstrated by faster tuning and BLEU score improvements using sentence length-based selection.

Contribution

It introduces a novel method of creating tuning datasets by selecting sentence subsets based on length to enhance SMT hyper-parameter robustness and tuning efficiency.

Findings

01

Two-fold tuning speedup achieved.

02

BLEU score improvements comparable to existing methods.

03

Effective subset selection based on sentence length.

Abstract

We explore the idea of automatically crafting a tuning dataset for Statistical Machine Translation (SMT) that makes the hyper-parameters of the SMT system more robust with respect to some specific deficiencies of the parameter tuning algorithms. This is an under-explored research direction, which can allow better parameter tuning. In this paper, we achieve this goal by selecting a subset of the available sentence pairs, which are more suitable for specific combinations of optimizers, objective functions, and evaluation measures. We demonstrate the potential of the idea with the pairwise ranking optimization (PRO) optimizer, which is known to yield too short translations. We show that the learning problem can be alleviated by tuning on a subset of the development set, selected based on sentence length. In particular, using the longest 50% of the tuning sentences, we achieve two-fold…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems