Hy-MT2: A Family of Fast, Efficient and Powerful Multilingual Translation Models in the Wild
Mao Zheng, Zheng Li, Tao Chen, Bo Lv, Mingrui Sun, Mingyang Song, Jinlong Song, Hong Huang, Decheng Wu, Hai Wang, Yifan Song, Yanfeng Chen, Guanwei Zhang, Guanghua Yu, Yi Su, Hong Liu, Jinxiang Ou, Keyao Wang, Weile Chen, Haozhao Kuang, Kai Wang, Nuo Chen, Zihao Zheng

TL;DR
Hy-MT2 is a family of multilingual translation models optimized for speed and efficiency, supporting 33 languages and excelling in real-world translation tasks across various domains.
Contribution
Introduces a new family of multilingual translation models with multiple sizes, optimized for deployment and outperforming existing open-source and commercial models.
Findings
The 1.8B model requires only 440 MB storage with 1.5x faster inference.
7B and 30B models outperform open-source models like DeepSeek-V4-Pro.
The lightweight 1.8B model surpasses mainstream commercial APIs in performance.
Abstract
Hy-MT2 is a family of fast-thinking multilingual translation models designed for complex real-world scenarios. It includes three model sizes: 1.8B, 7B, and 30B-A3B (MoE), all of which support translation among 33 languages and effectively follow translation instructions in multiple languages. For on-device deployment, with AngelSlim 1.25-bit extreme quantization, the 1.8B model requires only 440 MB of storage and improves inference speed by 1.5x. Multi-dimensional evaluations show that Hy-MT2 delivers outstanding performance across general, real-world business, domain-specific, and instruction-following translation tasks. The 7B and 30B models outperform open-source models such as DeepSeek-V4-Pro and Kimi K2.6 in fast-thinking mode, while the lightweight 1.8B model also surpasses mainstream commercial APIs from providers such as Microsoft and Doubao overall.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗tencent/Hy-MT2-30B-A3Bmodel· 3.1k dl· ♡ 4253.1k dl♡ 425
- 🤗tencent/Hy-MT2-1.8Bmodel· 16k dl· ♡ 108816k dl♡ 1088
- 🤗tencent/Hy-MT2-1.8B-GGUFmodel· 18k dl· ♡ 7318k dl♡ 73
- 🤗tencent/Hy-MT2-7Bmodel· 4.7k dl· ♡ 1774.7k dl♡ 177
- 🤗tencent/Hy-MT2-7B-GGUFmodel· 19k dl· ♡ 3619k dl♡ 36
- 🤗tencent/Hy-MT2-30B-A3B-FP8model· 8.3k dl· ♡ 148.3k dl♡ 14
- 🤗tencent/Hy-MT2-1.8B-1.25Bit-GGUFmodel· 4.4k dl· ♡ 294.4k dl♡ 29
- 🤗tencent/Hy-MT2-7B-FP8model· 1.3k dl· ♡ 91.3k dl♡ 9
- 🤗tencent/Hy-MT2-1.8B-FP8model· 943 dl· ♡ 9943 dl♡ 9
- 🤗unsloth/Hy-MT2-7B-GGUFmodel· ♡ 3♡ 3
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
