Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models
Ashwin K Vijayakumar, Michael Cogswell, Ramprasath R. Selvaraju, Qing, Sun, Stefan Lee, David Crandall, Dhruv Batra

TL;DR
Diverse Beam Search (DBS) enhances neural sequence decoding by generating more varied outputs, improving solution quality across tasks like image captioning and translation with minimal additional computational cost.
Contribution
This paper introduces Diverse Beam Search, a novel decoding algorithm that produces diverse output sequences from neural models, outperforming standard beam search and previous methods.
Findings
DBS finds better top-1 solutions than beam search.
DBS maintains computational efficiency comparable to beam search.
DBS improves diversity and quality across multiple tasks.
Abstract
Neural sequence models are widely used to model time-series data. Equally ubiquitous is the usage of beam search (BS) as an approximate inference algorithm to decode output sequences from these models. BS explores the search space in a greedy left-right fashion retaining only the top-B candidates - resulting in sequences that differ only slightly from each other. Producing lists of nearly identical sequences is not only computationally wasteful but also typically fails to capture the inherent ambiguity of complex AI tasks. To overcome this problem, we propose Diverse Beam Search (DBS), an alternative to BS that decodes a list of diverse outputs by optimizing for a diversity-augmented objective. We observe that our method finds better top-1 solutions by controlling for the exploration and exploitation of the search space - implying that DBS is a better search algorithm. Moreover, these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗alexrink/Pegasus_VTSSum_14epochsmodel· 2 dl2 dl
- 🤗alexrink/Pegasus_VTSSum_9epochsmodel· 1 dl1 dl
- 🤗alexrink/Pegasus_VTSSum_5epochsmodel· 2 dl2 dl
- 🤗alexrink/Pegasus_VTSSum_3epochsmodel· 1 dl1 dl
- 🤗alexrink/Pegasus_VTSSum_4epochsmodel· 1 dl1 dl
- 🤗alexrink/Pegasus_CNN_VTSSum_5epochsmodel· 1 dl1 dl
- 🤗transformers-community/group-beam-searchmodel· 63 dl63 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Explainable Artificial Intelligence (XAI)
