X-ALMA: Plug & Play Modules and Adaptive Rejection for Quality   Translation at Scale

Haoran Xu; Kenton Murray; Philipp Koehn; Hieu Hoang; Akiko Eriguchi,; Huda Khayrallah

arXiv:2410.03115·cs.CL·March 4, 2025

X-ALMA: Plug & Play Modules and Adaptive Rejection for Quality Translation at Scale

Haoran Xu, Kenton Murray, Philipp Koehn, Hieu Hoang, Akiko Eriguchi,, Huda Khayrallah

PDF

Open Access 10 Models 3 Datasets

TL;DR

X-ALMA is a multilingual translation model that achieves high-quality results across 50 languages by using plug-and-play modules and a novel adaptive rejection optimization, outperforming existing models on major benchmarks.

Contribution

The paper introduces X-ALMA, a new multilingual translation model with a plug-and-play architecture and ARPO optimization, ensuring balanced high-quality translation across resource levels.

Findings

01

X-ALMA outperforms state-of-the-art open-source multilingual LLMs on FLORES-200 and WMT'23 datasets.

02

The plug-and-play module architecture prevents language conflicts during training.

03

ARPO optimization surpasses existing preference methods in translation quality.

Abstract

Large language models (LLMs) have achieved remarkable success across various NLP tasks with a focus on English due to English-centric pre-training and limited multilingual data. In this work, we focus on the problem of translation, and while some multilingual LLMs claim to support for hundreds of languages, models often fail to provide high-quality responses for mid- and low-resource languages, leading to imbalanced performance heavily skewed in favor of high-resource languages. We introduce **X-ALMA**, a model designed to ensure top-tier performance across 50 diverse languages, regardless of their resource levels. X-ALMA surpasses state-of-the-art open-source multilingual LLMs, such as Aya-101 and Aya-23, in every single translation direction on the FLORES-200 and WMT'23 test datasets according to COMET-22. This is achieved by plug-and-play language-specific module architecture to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques

MethodsFocus