PsFuture: A Pseudo-Future-based Zero-Shot Adaptive Policy for   Simultaneous Machine Translation

Libo Zhao; Jing Li; Ziqian Zeng

arXiv:2410.04075·cs.CL·October 8, 2024

PsFuture: A Pseudo-Future-based Zero-Shot Adaptive Policy for Simultaneous Machine Translation

Libo Zhao, Jing Li, Ziqian Zeng

PDF

Open Access 1 Video

TL;DR

PsFuture introduces a zero-shot adaptive policy for simultaneous translation that eliminates the need for additional training, leveraging a novel training strategy to improve quality-latency trade-offs.

Contribution

The paper presents PsFuture, the first zero-shot adaptive read/write policy for SiMT, and a new Prefix-to-Full training method for offline models adapted to real-time translation.

Findings

01

Zero-shot policy performs on par with strong baselines.

02

P2F training enhances translation quality and latency trade-off.

03

Method reduces training complexity for SiMT systems.

Abstract

Simultaneous Machine Translation (SiMT) requires target tokens to be generated in real-time as streaming source tokens are consumed. Traditional approaches to SiMT typically require sophisticated architectures and extensive parameter configurations for training adaptive read/write policies, which in turn demand considerable computational power and memory. We propose PsFuture, the first zero-shot adaptive read/write policy for SiMT, enabling the translation model to independently determine read/write actions without the necessity for additional training. Furthermore, we introduce a novel training strategy, Prefix-to-Full (P2F), specifically tailored to adjust offline translation models for SiMT applications, exploiting the advantages of the bidirectional attention mechanism inherent in offline models. Experiments across multiple benchmarks demonstrate that our zero-shot policy attains…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

PsFuture: A Pseudo-Future-based Zero-Shot Adaptive Policy for Simultaneous Machine Translation· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsSoftmax · Attention Is All You Need