TAT-R1: Terminology-Aware Translation with Reinforcement Learning and Word Alignment

Zheng Li; Mao Zheng; Mingyang Song; Wenjie Yang

arXiv:2505.21172·cs.CL·May 28, 2025

TAT-R1: Terminology-Aware Translation with Reinforcement Learning and Word Alignment

Zheng Li, Mao Zheng, Mingyang Song, Wenjie Yang

PDF

Open Access

TL;DR

This paper introduces TAT-R1, a reinforcement learning-based, terminology-aware translation model that improves translation accuracy for key terminology while maintaining overall translation quality.

Contribution

It presents a novel approach combining word alignment and reinforcement learning to enhance terminology translation in deep reasoning large language models.

Findings

01

Significant improvement in terminology translation accuracy.

02

Maintains comparable performance on general translation tasks.

03

Provides insights through ablation studies on training paradigms.

Abstract

Recently, deep reasoning large language models(LLMs) like DeepSeek-R1 have made significant progress in tasks such as mathematics and coding. Inspired by this, several studies have employed reinforcement learning(RL) to enhance models' deep reasoning capabilities and improve machine translation(MT) quality. However, the terminology translation, an essential task in MT, remains unexplored in deep reasoning LLMs. In this paper, we propose \textbf{TAT-R1}, a terminology-aware translation model trained with reinforcement learning and word alignment. Specifically, we first extract the keyword translation pairs using a word alignment model. Then we carefully design three types of rule-based alignment rewards with the extracted alignment relationships. With those alignment rewards, the RL-trained translation model can learn to focus on the accurate translation of key information, including…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification