NiuTrans.LMT: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
Yingfeng Luo, Ziqiang Xu, Yuxuan Ouyang, Murun Yang, Dingyang Lin, Kaiyan Chang, Tong Zheng, Bei Li, Peinan Feng, Quan Du, Tong Xiao, Jingbo Zhu

TL;DR
This paper introduces NiuTrans.LMT, a scalable multilingual translation model that addresses directional degeneration using strategic downsampling and parallel prompting, achieving competitive results across 60 languages.
Contribution
The paper proposes novel methods—strategic downsampling and parallel multilingual prompting—to improve multilingual translation quality and scalability in large language models.
Findings
NiuTrans.LMT models perform on par or better than larger baselines.
Strategic Downsampling mitigates directional degeneration in multilingual training.
Parallel Multilingual Prompting enhances cross-lingual transfer and test-time performance.
Abstract
Large language models have significantly advanced Multilingual Machine Translation (MMT), yet scaling to many languages while keeping quality robust across directions remains challenging. In this paper, we identify a failure mode of multilingual supervised fine-tuning (SFT) on multi-way parallel data: when such data are reused symmetrically around a pivot language (e.g., English), performance on reverse directions (X pivot) can drop substantially. We term this phenomenon Directional Degeneration and attribute it to excessive many-to-one mappings, which encourage shortcut learning. We propose Strategic Downsampling (SD), a simple yet effective method to mitigate this degeneration. In addition, we introduce Parallel Multilingual Prompting (PMP), which augments translation instructions with an auxiliary parallel sentence to promote cross-lingual transfer during training and enables…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗NiuTrans/LMT-60-0.6B-Basemodel· 31 dl· ♡ 1031 dl♡ 10
- 🤗NiuTrans/LMT-60-4Bmodel· 1.4k dl· ♡ 91.4k dl♡ 9
- 🤗NiuTrans/LMT-60-0.6Bmodel· 791 dl· ♡ 9791 dl♡ 9
- 🤗NiuTrans/LMT-60-1.7B-Basemodel· 70 dl· ♡ 770 dl♡ 7
- 🤗NiuTrans/LMT-60-1.7Bmodel· 1.1k dl· ♡ 101.1k dl♡ 10
- 🤗NiuTrans/LMT-60-4B-Basemodel· 39 dl· ♡ 639 dl♡ 6
- 🤗NiuTrans/LMT-60-8B-Basemodel· 37 dl· ♡ 537 dl♡ 5
- 🤗NiuTrans/LMT-60-8Bmodel· 212 dl· ♡ 14212 dl♡ 14
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
