HY-MT1.5 Technical Report

Mao Zheng; Zheng Li; Tao Chen; Mingyang Song; Di Wang

arXiv:2512.24092·cs.CL·January 1, 2026

HY-MT1.5 Technical Report

Mao Zheng, Zheng Li, Tao Chen, Mingyang Song, Di Wang

PDF

Open Access 8 Models

TL;DR

This paper introduces HY-MT1.5 translation models with a holistic training framework, achieving high performance and efficiency across multiple translation benchmarks and supporting advanced translation constraints.

Contribution

The paper presents a new family of translation models with a holistic training approach, outperforming larger open-source and commercial models at similar sizes.

Findings

01

HY-MT1.5-1.8B outperforms larger open-source and commercial models.

02

HY-MT1.5-7B achieves state-of-the-art results for its size class.

03

Models support advanced translation constraints like terminology and context.

Abstract

In this report, we introduce our latest translation models, HY-MT1.5-1.8B and HY-MT1.5-7B, a new family of machine translation models developed through a holistic training framework tailored for high-performance translation. Our methodology orchestrates a multi-stage pipeline that integrates general and MT-oriented pre-training, supervised fine-tuning, on-policy distillation, and reinforcement learning. HY-MT1.5-1.8B, the 1.8B-parameter model demonstrates remarkable parameter efficiency, comprehensively outperforming significantly larger open-source baselines (e.g., Tower-Plus-72B, Qwen3-32B) and mainstream commercial APIs (e.g., Microsoft Translator, Doubao Translator) in standard Chinese-foreign and English-foreign tasks. It achieves approximately 90% of the performance of ultra-large proprietary models such as Gemini-3.0-Pro, while marginally trailing Gemini-3.0-Pro on WMT25 and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Big Data and Digital Economy · Topic Modeling