Focus Attention: Promoting Faithfulness and Diversity in Summarization

Rahul Aralikatte; Shashi Narayan; Joshua Maynez; Sascha Rothe; Ryan; McDonald

arXiv:2105.11921·cs.CL·May 26, 2021

Focus Attention: Promoting Faithfulness and Diversity in Summarization

Rahul Aralikatte, Shashi Narayan, Joshua Maynez, Sascha Rothe, Ryan, McDonald

PDF

TL;DR

This paper introduces Focus Attention and Focus Sampling mechanisms to improve faithfulness and diversity in summarization, achieving better results on the BBC extreme summarization task.

Contribution

It proposes novel attention and sampling methods that enhance faithfulness and diversity in summarization models, addressing gaps in current seq2seq approaches.

Findings

01

Focus Attention improves summary faithfulness and relevance.

02

Focus Sampling yields more diverse and faithful summaries.

03

Models with Focus mechanisms outperform vanilla models on ROUGE and faithfulness metrics.

Abstract

Professional summaries are written with document-level information, such as the theme of the document, in mind. This is in contrast with most seq2seq decoders which simultaneously learn to focus on salient content, while deciding what to generate, at each decoding step. With the motivation to narrow this gap, we introduce Focus Attention Mechanism, a simple yet effective method to encourage decoders to proactively generate tokens that are similar or topical to the input document. Further, we propose a Focus Sampling method to enable generation of diverse summaries, an area currently understudied in summarization. When evaluated on the BBC extreme summarization task, two state-of-the-art models augmented with Focus Attention generate summaries that are closer to the target and more faithful to their input documents, outperforming their vanilla counterparts on \rouge and multiple…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory · Sequence to Sequence