R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning

Minggui He; Yilun Liu; Shimin Tao; Yuanchang Luo; Hongyong Zeng; Chang Su; Li Zhang; Hongxia Ma; Daimeng Wei; Weibin Meng; Hao Yang; Boxing Chen; Osamu Yoshie

arXiv:2502.19735·cs.CL·May 27, 2025

R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning

Minggui He, Yilun Liu, Shimin Tao, Yuanchang Luo, Hongyong Zeng, Chang Su, Li Zhang, Hongxia Ma, Daimeng Wei, Weibin Meng, Hao Yang, Boxing Chen, Osamu Yoshie

PDF

Open Access

TL;DR

This paper presents R1-T1, a reinforcement learning framework that enhances large language models' translation capabilities by incorporating human-aligned reasoning chains, improving performance across diverse languages and domains.

Contribution

The paper introduces a novel RL-based approach with expert-designed reasoning templates to improve general translation performance and adaptability in LLMs.

Findings

01

Improved translation quality across 10+ languages and 40+ directions.

02

Enhanced performance on unseen languages and domain-specific tasks.

03

Demonstrated effectiveness of reasoning-based translation in multiple scenarios.

Abstract

Despite recent breakthroughs in reasoning-enhanced large language models (LLMs) like DeepSeek-R1, incorporating inference-time reasoning into machine translation (MT), where human translators naturally employ structured, multi-layered reasoning chain-of-thoughts (CoTs), is yet underexplored. Existing methods either design a fixed CoT tailored for a specific MT sub-task (e.g., literature translation), or rely on synthesizing CoTs unaligned with humans and supervised fine-tuning (SFT) prone to overfitting, limiting their adaptability to diverse translation scenarios. This paper introduces R1-Translator (R1-T1), a novel framework to achieve inference-time reasoning for general MT via reinforcement learning (RL) with human-aligned CoTs comprising six common patterns. Our approach pioneers three innovations: (1) extending reasoning-based translation to broader MT scenarios (e.g.,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques

MethodsShrink and Fine-Tune