IKUN for WMT24 General MT Task: LLMs Are here for Multilingual Machine Translation
Baohao Liao, Christian Herold, Shahram Khadivi, Christof Monz

TL;DR
This paper presents IKUN and IKUN-C, two multilingual LLM-based systems for WMT24, demonstrating strong competitive performance across 11 language pairs and highlighting LLMs' potential in multilingual machine translation.
Contribution
Introduces IKUN and IKUN-C systems built on Llama-3 and Mistral-7B, showcasing effective multilingual translation with a two-stage training approach and competitive results.
Findings
IKUN-C achieved 6 first-place finishes among constrained systems.
IKUN secured 1 first-place and 2 second-place finishes overall.
Both systems show LLMs nearing effective multilingual translation capabilities.
Abstract
This paper introduces two multilingual systems, IKUN and IKUN-C, developed for the general machine translation task in WMT24. IKUN and IKUN-C represent an open system and a constrained system, respectively, built on Llama-3-8b and Mistral-7B-v0.3. Both systems are designed to handle all 11 language directions using a single model. According to automatic evaluation metrics, IKUN-C achieved 6 first-place and 3 second-place finishes among all constrained systems, while IKUN secured 1 first-place and 2 second-place finishes across both open and constrained systems. These encouraging results suggest that large language models (LLMs) are nearing the level of proficiency required for effective multilingual machine translation. The systems are based on a two-stage approach: first, continuous pre-training on monolingual data in 10 languages, followed by fine-tuning on high-quality parallel data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
MethodsOSCAR
