Best-First Beam Search

Clara Meister; Tim Vieira; Ryan Cotterell

arXiv:2007.03909·cs.CL·November 16, 2022

Best-First Beam Search

Clara Meister, Tim Vieira, Ryan Cotterell

PDF

1 Repo

TL;DR

This paper introduces a faster, memory-efficient variant of beam search for NLP decoding that leverages monotonic scoring assumptions to prune hypotheses early, improving speed without sacrificing performance.

Contribution

It proposes a novel monotonic approximation approach to accelerate beam search and a memory-reduced variant that maintains search bias benefits.

Findings

01

Up to 10x faster decoding in practice

02

Effective pruning based on monotonic scoring functions

03

Memory reduction with maintained search bias

Abstract

Decoding for many NLP tasks requires an effective heuristic algorithm for approximating exact search since the problem of searching the full output space is often intractable, or impractical in many settings. The default algorithm for this job is beam search -- a pruned version of breadth-first search. Quite surprisingly, beam search often returns better results than exact inference due to beneficial search bias for NLP tasks. In this work, we show that the standard implementation of beam search can be made up to 10x faster in practice. Our method assumes that the scoring function is monotonic in the sequence length, which allows us to safely prune hypotheses that cannot be in the final set of hypotheses early on. We devise effective monotonic approximations to popular nonmonontic scoring functions, including length normalization and mutual information decoding. Lastly, we propose a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rycolab/bfbs
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.