Word Alignment as Preference for Machine Translation
Qiyu Wu, Masaaki Nagata, Zhongtao Miao, Yoshimasa Tsuruoka

TL;DR
This paper introduces a novel method to improve machine translation by guiding large language models with word alignment preferences, effectively reducing hallucination and omission issues, and providing new evaluation strategies.
Contribution
It proposes using word alignment as a preference signal for optimizing LLM-based MT models and introduces GPT-4-based evaluation for hallucination and omission mitigation.
Findings
Word alignment preference optimization reduces hallucination and omission.
The method shows slight BLEU improvements and decreases in COMET scores.
GPT-4-based evaluation effectively assesses hallucination and omission mitigation.
Abstract
The problem of hallucination and omission, a long-standing problem in machine translation (MT), is more pronounced when a large language model (LLM) is used in MT because an LLM itself is susceptible to these phenomena. In this work, we mitigate the problem in an LLM-based MT model by guiding it to better word alignment. We first study the correlation between word alignment and the phenomena of hallucination and omission in MT. Then we propose to utilize word alignment as preference to optimize the LLM-based MT model. The preference data are constructed by selecting chosen and rejected translations from multiple MT tools. Subsequently, direct preference optimization is used to optimize the LLM-based model towards the preference signal. Given the absence of evaluators specifically designed for hallucination and omission in MT, we further propose selecting hard instances and utilizing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques
MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Dense Connections · Position-Wise Feed-Forward Layer · Dropout · Label Smoothing · Residual Connection · Absolute Position Encodings · Byte Pair Encoding
