TL;DR
This paper introduces incremental beam manipulation, a decoding technique that dynamically reranks hypotheses during neural language generation, improving output quality without retraining models.
Contribution
It proposes a novel incremental reranking method during beam search, enhancing natural language generation performance over standard beam search and reranking approaches.
Findings
Improves BLEU scores by 1.93 and 5.82 points on E2E and WebNLG datasets.
Outperforms a strong reranker on E2E dataset.
Achieves comparable results to reranking on WebNLG.
Abstract
The performance of natural language generation systems has improved substantially with modern neural networks. At test time they typically employ beam search to avoid locally optimal but globally suboptimal predictions. However, due to model errors, a larger beam size can lead to deteriorating performance according to the evaluation metric. For this reason, it is common to rerank the output of beam search, but this relies on beam search to produce a good set of hypotheses, which limits the potential gains. Other alternatives to beam search require changes to the training of the model, which restricts their applicability compared to beam search. This paper proposes incremental beam manipulation, i.e. reranking the hypotheses in the beam during decoding instead of only at the end. This way, hypotheses that are unlikely to lead to a good final output are discarded, and in their place…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
