Language Models and Cycle Consistency for Self-Reflective Machine   Translation

Jianqiao Wangni

arXiv:2411.02791·cs.CL·November 6, 2024

Language Models and Cycle Consistency for Self-Reflective Machine Translation

Jianqiao Wangni

PDF

Open Access

TL;DR

This paper proposes a cycle consistency framework using large language models to evaluate and improve machine translation quality without ground-truth references, leveraging back-translation and token-level metrics.

Contribution

It introduces a novel cycle consistency method for unsupervised translation quality estimation and LLM capability assessment based solely on monolingual data.

Findings

01

Larger LLMs show higher cycle consistency scores.

02

More inference passes improve translation quality.

03

Cycle consistency correlates with model size and computation.

Abstract

This paper introduces a novel framework that leverages large language models (LLMs) for machine translation (MT). We start with one conjecture: an ideal translation should contain complete and accurate information for a strong enough LLM to recover the original sentence. We generate multiple translation candidates from a source language A to a target language B, and subsequently translate these candidates back to the original language A. By evaluating the cycle consistency between the original and back-translated sentences using metrics such as token-level precision and accuracy, we implicitly estimate the translation quality in language B, without knowing its ground-truth. This also helps to evaluate the LLM translation capability, only with monolingual corpora. For each source sentence, we identify the translation candidate with optimal cycle consistency with the original sentence as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques