Paying More Attention to Source Context: Mitigating Unfaithful   Translations from Large Language Model

Hongbin Zhang; Kehai Chen; Xuefeng Bai; Yang Xiang; Min Zhang

arXiv:2406.07036·cs.CL·June 12, 2024

Paying More Attention to Source Context: Mitigating Unfaithful Translations from Large Language Model

Hongbin Zhang, Kehai Chen, Xuefeng Bai, Yang Xiang, Min Zhang

PDF

Open Access 1 Repo

TL;DR

This paper addresses the issue of unfaithful translations in large language models by proposing methods to enhance source context attention, resulting in more accurate and faithful multilingual translations.

Contribution

It introduces novel techniques to improve source context attention in decoder-only LLMs, reducing unfaithfulness in translations during zero-shot prompting.

Findings

01

Effective reduction in unfaithful translations across multiple language pairs.

02

Improved human evaluation scores for translation faithfulness.

03

Method outperforms baseline models in faithfulness metrics.

Abstract

Large language models (LLMs) have showcased impressive multilingual machine translation ability. However, unlike encoder-decoder style models, decoder-only LLMs lack an explicit alignment between source and target contexts. Analyzing contribution scores during generation processes revealed that LLMs can be biased towards previously generated tokens over corresponding source tokens, leading to unfaithful translations. To address this issue, we propose to encourage LLMs to pay more attention to the source context from both source and target perspectives in zeroshot prompting: 1) adjust source context attention weights; 2) suppress irrelevant target prefix influence; Additionally, we propose 3) avoiding over-reliance on the target prefix in instruction tuning. Experimental results from both human-collected unfaithfulness test sets focusing on LLM-generated unfaithful translations and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

AzureStarz/paying_attention_to_the_source
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling