Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation   with Multi-Armed Bandits

Julia Kreutzer; David Vilar; Artem Sokolov

arXiv:2110.06997·cs.CL·October 15, 2021

Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits

Julia Kreutzer, David Vilar, Artem Sokolov

PDF

Open Access

TL;DR

This paper introduces a multi-armed bandit approach to dynamically balance multi-faceted training data in machine translation, reducing manual tuning and improving system performance across various multi-facet scenarios.

Contribution

It proposes a novel bandit-based method to automatically optimize data facet selection during MT training, enhancing adaptability and reducing manual intervention.

Findings

01

Bandit learning achieves competitive MT performance.

02

The approach adapts effectively across multiple facets.

03

Insights into data selection strategies are provided.

Abstract

Training data for machine translation (MT) is often sourced from a multitude of large corpora that are multi-faceted in nature, e.g. containing contents from multiple domains or different levels of quality or complexity. Naturally, these facets do not occur with equal frequency, nor are they equally important for the test scenario at hand. In this work, we propose to optimize this balance jointly with MT model parameters to relieve system developers from manual schedule design. A multi-armed bandit is trained to dynamically choose between facets in a way that is most beneficial for the MT system. We evaluate it on three different multi-facet applications: balancing translationese and natural training data, or data from multiple domains or multiple language pairs. We find that bandit learning leads to competitive MT systems across tasks, and our analysis provides insights into its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques

MethodsTest