Language Models and Cycle Consistency for Self-Reflective Machine Translation
Jianqiao Wangni

TL;DR
This paper proposes a cycle consistency framework using large language models to evaluate and improve machine translation quality without ground-truth references, leveraging back-translation and token-level metrics.
Contribution
It introduces a novel cycle consistency method for unsupervised translation quality estimation and LLM capability assessment based solely on monolingual data.
Findings
Larger LLMs show higher cycle consistency scores.
More inference passes improve translation quality.
Cycle consistency correlates with model size and computation.
Abstract
This paper introduces a novel framework that leverages large language models (LLMs) for machine translation (MT). We start with one conjecture: an ideal translation should contain complete and accurate information for a strong enough LLM to recover the original sentence. We generate multiple translation candidates from a source language A to a target language B, and subsequently translate these candidates back to the original language A. By evaluating the cycle consistency between the original and back-translated sentences using metrics such as token-level precision and accuracy, we implicitly estimate the translation quality in language B, without knowing its ground-truth. This also helps to evaluate the LLM translation capability, only with monolingual corpora. For each source sentence, we identify the translation candidate with optimal cycle consistency with the original sentence as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
