A Simple, Fast Diverse Decoding Algorithm for Neural Generation

Jiwei Li; Will Monroe; Dan Jurafsky

arXiv:1611.08562·cs.CL·December 23, 2016·227 cites

A Simple, Fast Diverse Decoding Algorithm for Neural Generation

Jiwei Li, Will Monroe, Dan Jurafsky

PDF

Open Access 1 Repo

TL;DR

This paper introduces a simple, fast decoding algorithm that enhances diversity in neural generation tasks like dialogue, summarization, and translation, with an adaptive RL-based variation further improving performance.

Contribution

It presents a novel diversity-promoting decoding algorithm with an RL-based adaptive variant, improving neural generation quality across multiple tasks.

Findings

01

Diverse decoding improves task performance, especially with reranking.

02

The RL-based adaptive method further boosts results.

03

The approach is effective in dialogue, summarization, and translation.

Abstract

In this paper, we propose a simple, fast decoding algorithm that fosters diversity in neural generation. The algorithm modifies the standard beam search algorithm by adding an inter-sibling ranking penalty, favoring choosing hypotheses from diverse parents. We evaluate the proposed model on the tasks of dialogue response generation, abstractive summarization and machine translation. We find that diverse decoding helps across all tasks, especially those for which reranking is needed. We further propose a variation that is capable of automatically adjusting its diversity decoding rates for different inputs using reinforcement learning (RL). We observe a further performance boost from this RL technique. This paper includes material from the unpublished script "Mutual Information and Diverse Decoding Improve Neural Machine Translation" (Li and Jurafsky, 2016).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

TaiseiAso/BiGruAttEncDec
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems