Speculative Beam Search for Simultaneous Translation

Renjie Zheng; Mingbo Ma; Baigong Zheng; Liang Huang

arXiv:1909.05421·cs.CL·September 13, 2019

Speculative Beam Search for Simultaneous Translation

Renjie Zheng, Mingbo Ma, Baigong Zheng, Liang Huang

PDF

Open Access

TL;DR

This paper introduces a speculative beam search algorithm that enables beam search to be used effectively in simultaneous translation by hallucinating future steps, leading to significant improvements across multiple language pairs.

Contribution

It proposes a novel speculative beam search method that allows beam search to generate one word at a time in simultaneous translation, overcoming previous limitations.

Findings

01

Large improvements over previous methods in diverse language pairs

02

Enables beam search to be applied to single-word generation in real-time translation

03

Implicitly benefits from a target language model

Abstract

Beam search is universally used in full-sentence translation but its application to simultaneous translation remains non-trivial, where output words are committed on the fly. In particular, the recently proposed wait-k policy (Ma et al., 2019a) is a simple and effective method that (after an initial wait) commits one output word on receiving each input word, making beam search seemingly impossible. To address this challenge, we propose a speculative beam search algorithm that hallucinates several steps into the future in order to reach a more accurate decision, implicitly benefiting from a target language model. This makes beam search applicable for the first time to the generation of a single word in each step. Experiments over diverse language pairs show large improvements over previous work.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications