When to Finish? Optimal Beam Search for Neural Text Generation (modulo   beam size)

Liang Huang; Kai Zhao; Mingbo Ma

arXiv:1809.00069·cs.CL·September 5, 2018·5 cites

When to Finish? Optimal Beam Search for Neural Text Generation (modulo beam size)

Liang Huang, Kai Zhao, Mingbo Ma

PDF

Open Access

TL;DR

This paper introduces a provably optimal beam search algorithm for neural text generation that guarantees the best hypothesis is found efficiently, improving translation quality.

Contribution

It presents a new beam search method with proven optimality and a length reward mechanism to handle shorter hypotheses, advancing neural text generation techniques.

Findings

01

Improved BLEU scores in neural machine translation

02

Algorithm guarantees optimal hypothesis completion

03

Handles length bias with bounded length reward

Abstract

In neural text generation such as neural machine translation, summarization, and image captioning, beam search is widely used to improve the output text quality. However, in the neural generation setting, hypotheses can finish in different steps, which makes it difficult to decide when to end beam search to ensure optimality. We propose a provably optimal beam search algorithm that will always return the optimal-score complete hypothesis (modulo beam size), and finish as soon as the optimality is established (finishing no later than the baseline). To counter neural generation's tendency for shorter hypotheses, we also introduce a bounded length reward mechanism which allows a modified version of our beam search algorithm to remain optimal. Experiments on neural machine translation demonstrate that our principled beam search algorithm leads to improvement in BLEU score over previously…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications