Massive-scale Decoding for Text Generation using Lattices

Jiacheng Xu; Siddhartha Reddy Jonnalagadda; Greg Durrett

arXiv:2112.07660·cs.CL·May 4, 2022

Massive-scale Decoding for Text Generation using Lattices

Jiacheng Xu, Siddhartha Reddy Jonnalagadda, Greg Durrett

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel lattice-based search algorithm for neural text generation that efficiently encodes thousands of diverse, high-quality options, surpassing traditional beam search in diversity and efficiency.

Contribution

It presents a best-first search algorithm with hypothesis recombination to generate massive, diverse text options in a single lattice structure, improving over existing methods.

Findings

01

Encodes thousands of diverse options in a single lattice

02

Improves efficiency over beam search

03

Maintains grammaticality and quality of generated options

Abstract

Conditional neural text generation models generate high-quality outputs, but often concentrate around a mode when what we really want is a diverse set of options. We present a search algorithm to construct lattices encoding a massive number of generation options. First, we restructure decoding as a best-first search, which explores the space differently than beam search and improves efficiency by avoiding pruning paths. Second, we revisit the idea of hypothesis recombination: we can identify pairs of similar generation candidates during search and merge them as an approximation. On both summarization and machine translation, we show that our algorithm encodes thousands of diverse options that remain grammatical and high-quality into one lattice. This algorithm provides a foundation for building downstream generation applications on top of massive-scale diverse outputs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jiacheng-xu/lattice-generation
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems

MethodsPruning