Seed-X: Building Strong Multilingual Translation LLM with 7B Parameters
Shanbo Cheng, Yu Bao, Qian Cao, Luyang Huang, Liyan Kang, Zhicheng Liu, Yu Lu, Wenhao Zhu, Jingwen Chen, Zhichao Huang, Tao Li, Yifu Li, Huiying Lin, Sitong Liu, Ningxin Peng, Shuaijie She, Lu Xu, Nuo Xu, Sen Yang, Runsheng Yu, Yiming Yu, Liehao Zou, Hang Li, Lu Lu, Yuxuan Wang

TL;DR
Seed-X introduces a 7B parameter multilingual translation LLM that leverages diverse data, Chain-of-Thought reasoning, and reinforcement learning to achieve state-of-the-art performance across 28 languages, rivaling larger closed-source models.
Contribution
The paper presents Seed-X, a novel open-source multilingual translation LLM with innovative training and fine-tuning strategies for improved translation quality.
Findings
Seed-X matches performance of Gemini-2.5 and GPT-4o.
Outperforms larger open-source models in metrics and human evaluations.
Effective use of Chain-of-Thought reasoning and reinforcement learning.
Abstract
Multilingual translation stands as a challenging task for large language models (LLMs) to handle intricate language patterns and stilted translations that arise in automated translations. In this paper, we introduce Seed-X, a family of open-source LLMs comprising instruct and reasoning models, pushing the limits of translation capability with 7B parameter size. The base model is pre-trained on a diverse, high-quality dataset encompassing both monolingual and bilingual content across 28 languages, harnessing the full potential of multilingual data. The instruct model is then finetuned to translate by Chain-of-Thought (CoT) reasoning and further enhanced through reinforcement learning (RL) to achieve better generalization across diverse language pairs. Seed-X achieves performance comparable to leading closed-source models, including Gemini-2.5 and GPT-4o, across 28 languages, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗ByteDance-Seed/Seed-X-PPO-7Bmodel· 11k dl· ♡ 30011k dl♡ 300
- 🤗ByteDance-Seed/Seed-X-Instruct-7Bmodel· 889 dl· ♡ 128889 dl♡ 128
- 🤗ByteDance-Seed/Seed-X-RM-7Bmodel· 76 dl· ♡ 3076 dl♡ 30
- 🤗ByteDance-Seed/Seed-X-PPO-7B-GPTQ-Int8model· 2.2k dl· ♡ 92.2k dl♡ 9
- 🤗ByteDance-Seed/Seed-X-PPO-7B-AWQ-Int4model· 93 dl· ♡ 993 dl♡ 9
- 🤗BaoLocTown/Seed-X-Instruct-7B-FP8-Dynamicmodel
- 🤗BaoLocTown/Seed-X-PPO-7B-FP8-Dynamicmodel· 13 dl13 dl
- 🤗laelhalawani/Seed-X-PPO-7B-GGUFmodel· 15 dl15 dl
- 🤗laelhalawani/Seed-X-PPO-7B-EN-PL-AWQ-GGUFmodel· 22 dl· ♡ 122 dl♡ 1
- 🤗thanhh12/Seed-X-PPO-7B-GPTQ-INT8model· 1 dl· ♡ 11 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
