ReflectMT: Internalizing Reflection for Efficient and High-Quality Machine Translation

Kunquan Li; Yingxue Zhang; Fandong Meng; Jinsong Su

arXiv:2604.19144·cs.CL·April 22, 2026

ReflectMT: Internalizing Reflection for Efficient and High-Quality Machine Translation

Kunquan Li, Yingxue Zhang, Fandong Meng, Jinsong Su

PDF

TL;DR

ReflectMT introduces a two-stage internalization approach that enables high-quality machine translation in a single inference pass, significantly reducing costs while maintaining or improving translation quality.

Contribution

The paper presents a novel 'translate-reflect-refine' training paradigm that internalizes reasoning, allowing efficient high-quality translation without explicit multi-step reasoning during inference.

Findings

01

Outperforms multi-step reasoning models like DeepSeek-R1 in translation quality.

02

Reduces token consumption by over 94%.

03

Achieves a 2.16-point improvement in GPT-based evaluation metrics.

Abstract

Recent years have witnessed growing interest in applying Large Reasoning Models (LRMs) to Machine Translation (MT). Existing approaches predominantly adopt a "think-first-then-translate" paradigm. Although explicit reasoning trajectories significantly enhance translation quality, they incur prohibitive inference costs and latency. To address these limitations, we propose ReflectMT, a two-stage reflection internalization algorithm for machine translation that employs a "translate-first-think-later" paradigm. Our approach develops the model's "translate-reflect-refine" capability through reinforcement learning. In the first stage, we cultivate the model's capacity for high-quality reflection and refinement, thereby enhancing its semantic comprehension and task-specific knowledge. In the second stage, we train the model to internalize the knowledge acquired during reflection. As a result,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.