STACL: Simultaneous Translation with Implicit Anticipation and   Controllable Latency using Prefix-to-Prefix Framework

Mingbo Ma; Liang Huang; Hao Xiong; Renjie Zheng; Kaibo Liu; Baigong; Zheng; Chuanqiang Zhang; Zhongjun He; Hairong Liu; Xing Li; Hua Wu and; Haifeng Wang

arXiv:1810.08398·cs.CL·June 26, 2019·5 cites

STACL: Simultaneous Translation with Implicit Anticipation and Controllable Latency using Prefix-to-Prefix Framework

Mingbo Ma, Liang Huang, Hao Xiong, Renjie Zheng, Kaibo Liu, Baigong, Zheng, Chuanqiang Zhang, Zhongjun He, Hairong Liu, Xing Li, Hua Wu and, Haifeng Wang

PDF

Open Access 3 Repos

TL;DR

This paper introduces a prefix-to-prefix framework for simultaneous translation that enables models to anticipate words and control latency, achieving low delay and good quality across multiple language pairs.

Contribution

It proposes a novel prefix-to-prefix translation framework with an effective wait-k policy for low-latency, high-quality simultaneous translation.

Findings

01

Achieves low latency and reasonable translation quality

02

Effective anticipation in a single model

03

Works well across multiple language pairs

Abstract

Simultaneous translation, which translates sentences before they are finished, is useful in many scenarios but is notoriously difficult due to word-order differences. While the conventional seq-to-seq framework is only suitable for full-sentence translation, we propose a novel prefix-to-prefix framework for simultaneous translation that implicitly learns to anticipate in a single translation model. Within this framework, we present a very simple yet surprisingly effective wait-k policy trained to generate the target sentence concurrently with the source sentence, but always k words behind. Experiments show our strategy achieves low latency and reasonable quality (compared to full-sentence translation) on 4 directions: zh<->en and de<->en.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications