Glancing Future for Simultaneous Machine Translation

Shoutao Guo; Shaolei Zhang; Yang Feng

arXiv:2309.06179·cs.CL·May 29, 2024

Glancing Future for Simultaneous Machine Translation

Shoutao Guo, Shaolei Zhang, Yang Feng

PDF

Open Access 1 Repo

TL;DR

This paper introduces a curriculum learning approach that gradually shifts from full-sequence to prefix-based training in simultaneous machine translation, improving model performance by bridging the training gap.

Contribution

We propose a novel 'glancing future' curriculum learning method that transitions SiMT models from seq2seq to prefix2prefix training, enhancing translation quality.

Findings

01

Outperforms strong baseline methods

02

Applicable to various SiMT approaches

03

Improves translation accuracy and robustness

Abstract

Simultaneous machine translation (SiMT) outputs translation while reading the source sentence. Unlike conventional sequence-to-sequence (seq2seq) training, existing SiMT methods adopt the prefix-to-prefix (prefix2prefix) training, where the model predicts target tokens based on partial source tokens. However, the prefix2prefix training diminishes the ability of the model to capture global information and introduces forced predictions due to the absence of essential source information. Consequently, it is crucial to bridge the gap between the prefix2prefix training and seq2seq training to enhance the translation capability of the SiMT model. In this paper, we propose a novel method that glances future in curriculum learning to achieve the transition from the seq2seq training to prefix2prefix training. Specifically, we gradually reduce the available source information from the whole…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ictnlp/glance-simt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence