Contextual Refinement of Translations: Large Language Models for   Sentence and Document-Level Post-Editing

Sai Koneru; Miriam Exel; Matthias Huck; Jan Niehues

arXiv:2310.14855·cs.CL·March 20, 2024·2 cites

Contextual Refinement of Translations: Large Language Models for Sentence and Document-Level Post-Editing

Sai Koneru, Miriam Exel, Matthias Huck, Jan Niehues

PDF

Open Access 1 Video

TL;DR

This paper explores using large language models as automatic post-editors for translation, achieving state-of-the-art accuracy in pronoun resolution and demonstrating the benefits of manual feedback in document-level translation.

Contribution

It introduces a novel approach of adapting LLMs as post-editors with Low-Rank-Adapter fine-tuning, improving translation quality and generalization, especially at document level.

Findings

01

Achieved 89% accuracy on ContraPro pronoun resolution test.

02

Fine-tuning as post-editors outperforms direct translation fine-tuning.

03

Manual corrections significantly reduce subsequent editing effort.

Abstract

Large Language Models (LLM's) have demonstrated considerable success in various Natural Language Processing tasks, but they have yet to attain state-of-the-art performance in Neural Machine Translation (NMT). Nevertheless, their significant performance in tasks demanding a broad understanding and contextual processing shows their potential for translation. To exploit these abilities, we investigate using LLM's for MT and explore recent parameter-efficient fine-tuning techniques. Surprisingly, our initial experiments find that fine-tuning for translation purposes even led to performance degradation. To overcome this, we propose an alternative approach: adapting LLM's as Automatic Post-Editors (APE) rather than direct translators. Building on the LLM's exceptional ability to process and generate lengthy sequences, we also propose extending our approach to document-level translation. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Contextual Refinement of Translations: Large Language Models for Sentence and Document-Level Post-Editing· underline

Taxonomy

TopicsNatural Language Processing Techniques