Anticipating Future with Large Language Model for Simultaneous Machine Translation

Siqi Ouyang; Oleksii Hrinchuk; Zhehuai Chen; Vitaly Lavrukhin; Jagadeesh Balam; Lei Li; Boris Ginsburg

arXiv:2410.22499·cs.CL·June 3, 2025

Anticipating Future with Large Language Model for Simultaneous Machine Translation

Siqi Ouyang, Oleksii Hrinchuk, Zhehuai Chen, Vitaly Lavrukhin, Jagadeesh Balam, Lei Li, Boris Ginsburg

PDF

Open Access 1 Video

TL;DR

This paper introduces TAF, a novel approach using large language models to predict future source words in simultaneous machine translation, significantly improving translation quality at low latency.

Contribution

The paper proposes TAF, a new method leveraging LLMs for future word anticipation in SMT, enhancing quality-latency trade-offs over existing methods.

Findings

01

TAF outperforms baselines by up to 5 BLEU points at the same latency.

02

It achieves the best quality-latency trade-off among evaluated methods.

03

Code is publicly available for reproducibility.

Abstract

Simultaneous machine translation (SMT) takes streaming input utterances and incrementally produces target text. Existing SMT methods mainly use the partial utterance that has already arrived at the input and the generated hypothesis. Motivated by human interpreters' technique to forecast future words before hearing them, we propose $T$ ranslation by $A$ nticipating $F$ uture (TAF), a method to improve translation quality while retraining low latency. Its core idea is to use a large language model (LLM) to predict future source words and opportunistically translate without introducing too much risk. We evaluate our TAF and multiple baselines of SMT on four language directions. Experiments show that TAF achieves the best translation quality-latency trade-off and outperforms the baselines by up to 5 BLEU points at the same latency (three words). Code is released at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Anticipating Future with Large Language Model for Simultaneous Machine Translation· underline

Taxonomy

TopicsNatural Language Processing Techniques