Global-aware Beam Search for Neural Abstractive Summarization

Ye Ma; Zixun Lan; Lu Zong; Kaizhu Huang

arXiv:2009.06891·cs.CL·October 27, 2021·1 cites

Global-aware Beam Search for Neural Abstractive Summarization

Ye Ma, Zixun Lan, Lu Zong, Kaizhu Huang

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces a global-aware beam search algorithm for neural abstractive summarization that leverages attention distribution predictions to improve summary quality and robustness across multiple datasets.

Contribution

It proposes a novel global attention-based protocol and scoring mechanism that enhance beam search to achieve near-global optimal summaries in neural summarization.

Findings

01

Significant improvements over state-of-the-art models on nine datasets.

02

Robustness to corrupted attention distributions.

03

Effective in generating meaningful summaries with empirical hyper-parameters.

Abstract

This study develops a calibrated beam-based algorithm with awareness of the global attention distribution for neural abstractive summarization, aiming to improve the local optimality problem of the original beam search in a rigorous way. Specifically, a novel global protocol is proposed based on the attention distribution to stipulate how a global optimal hypothesis should attend to the source. A global scoring mechanism is then developed to regulate beam search to generate summaries in a near-global optimal fashion. This novel design enjoys a distinctive property, i.e., the global attention distribution could be predicted before inference, enabling step-wise improvements on the beam search through the global scoring mechanism. Extensive experiments on nine datasets show that the global (attention)-aware inference significantly improves state-of-the-art summarization models even using…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Global-aware Beam Search for Neural Abstractive Summarization· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Layer Normalization · Dropout · Dense Connections · Byte Pair Encoding · Label Smoothing · Multi-Head Attention · Attention Is All You Need