Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations

Dayeon Ki; Marine Carpuat

arXiv:2404.07851·cs.CL·September 3, 2025·1 cites

Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations

Dayeon Ki, Marine Carpuat

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper explores guiding large language models to improve machine translation post-editing using error annotations, combining prompting and fine-tuning to enhance translation quality across multiple language pairs.

Contribution

It introduces a method to leverage MQM error annotations for guiding LLMs in post-editing MT, demonstrating the benefits of fine-tuning for better feedback utilization.

Findings

01

Prompting improves translation metrics like TER, BLEU, and COMET.

02

Fine-tuning enhances the model's ability to utilize detailed feedback.

03

Both automatic and human evaluations show improved translation quality.

Abstract

Machine Translation (MT) remains one of the last NLP tasks where large language models (LLMs) have not yet replaced dedicated supervised systems. This work exploits the complementary strengths of LLMs and supervised MT by guiding LLMs to automatically post-edit MT with external feedback on its quality, derived from Multidimensional Quality Metric (MQM) annotations. Working with LLaMA-2 models, we consider prompting strategies varying the nature of feedback provided and then fine-tune the LLM to improve its ability to exploit the provided guidance. Through experiments on Chinese-English, English-German, and English-Russian MQM data, we demonstrate that prompting LLMs to post-edit MT improves TER, BLEU and COMET scores, although the benefits of fine-grained feedback are not clear. Fine-tuning helps integrate fine-grained feedback more effectively and further improves translation quality…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dayeonki/mt_feedback
pytorchOfficial

Videos

Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification