ReflectMT: Internalizing Reflection for Efficient and High-Quality Machine Translation
Kunquan Li, Yingxue Zhang, Fandong Meng, Jinsong Su

TL;DR
ReflectMT introduces a two-stage internalization approach that enables high-quality machine translation in a single inference pass, significantly reducing costs while maintaining or improving translation quality.
Contribution
The paper presents a novel 'translate-reflect-refine' training paradigm that internalizes reasoning, allowing efficient high-quality translation without explicit multi-step reasoning during inference.
Findings
Outperforms multi-step reasoning models like DeepSeek-R1 in translation quality.
Reduces token consumption by over 94%.
Achieves a 2.16-point improvement in GPT-based evaluation metrics.
Abstract
Recent years have witnessed growing interest in applying Large Reasoning Models (LRMs) to Machine Translation (MT). Existing approaches predominantly adopt a "think-first-then-translate" paradigm. Although explicit reasoning trajectories significantly enhance translation quality, they incur prohibitive inference costs and latency. To address these limitations, we propose ReflectMT, a two-stage reflection internalization algorithm for machine translation that employs a "translate-first-think-later" paradigm. Our approach develops the model's "translate-reflect-refine" capability through reinforcement learning. In the first stage, we cultivate the model's capacity for high-quality reflection and refinement, thereby enhancing its semantic comprehension and task-specific knowledge. In the second stage, we train the model to internalize the knowledge acquired during reflection. As a result,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
