Guiding In-Context Learning of LLMs through Quality Estimation for   Machine Translation

Javad Pourmostafa Roshan Sharami; Dimitar Shterionov; Pieter; Spronck

arXiv:2406.07970·cs.CL·September 19, 2024

Guiding In-Context Learning of LLMs through Quality Estimation for Machine Translation

Javad Pourmostafa Roshan Sharami, Dimitar Shterionov, Pieter, Spronck

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new in-context learning approach for machine translation that uses quality estimation to select the best examples, improving translation quality without needing references or fine-tuning.

Contribution

It proposes a domain-specific quality estimation guided search algorithm for selecting effective in-context examples in machine translation, outperforming existing methods.

Findings

01

Significant translation quality improvements over existing ICL methods.

02

Higher performance compared to fine-tuning models like mBART-50.

03

Effective in selecting impactful in-context examples without references.

Abstract

The quality of output from large language models (LLMs), particularly in machine translation (MT), is closely tied to the quality of in-context examples (ICEs) provided along with the query, i.e., the text to translate. The effectiveness of these ICEs is influenced by various factors, such as the domain of the source text, the order in which the ICEs are presented, the number of these examples, and the prompt templates used. Naturally, selecting the most impactful ICEs depends on understanding how these affect the resulting translation quality, which ultimately relies on translation references or human judgment. This paper presents a novel methodology for in-context learning (ICL) that relies on a search algorithm guided by domain-specific quality estimation (QE). Leveraging the XGLM model, our methodology estimates the resulting translation quality without the need for translation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

JoyeBright/ICLviaQE
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling