In Conclusion Not Repetition: Comprehensive Abstractive Summarization   With Diversified Attention Based On Determinantal Point Processes

Lei Li; Wei Liu; Marina Litvak; Natalia Vanetik; Zuying Huang

arXiv:1909.10852·cs.CL·January 3, 2020

In Conclusion Not Repetition: Comprehensive Abstractive Summarization With Diversified Attention Based On Determinantal Point Processes

Lei Li, Wei Liu, Marina Litvak, Natalia Vanetik, Zuying Huang

PDF

1 Repo

TL;DR

This paper proposes a diversified attention mechanism using Determinantal Point Processes in Seq2Seq models to enhance the abstraction and comprehensiveness of machine-generated summaries.

Contribution

It introduces the DivCNN Seq2Seq model with DPP-based attention to improve summary diversity and abstraction without altering the end-to-end architecture.

Findings

01

Achieves higher ROUGE scores than baseline models.

02

Produces more comprehensive and diverse summaries.

03

Maintains end-to-end training compatibility.

Abstract

Various Seq2Seq learning models designed for machine translation were applied for abstractive summarization task recently. Despite these models provide high ROUGE scores, they are limited to generate comprehensive summaries with a high level of abstraction due to its degenerated attention distribution. We introduce Diverse Convolutional Seq2Seq Model(DivCNN Seq2Seq) using Determinantal Point Processes methods(Micro DPPs and Macro DPPs) to produce attention distribution considering both quality and diversity. Without breaking the end to end architecture, DivCNN Seq2Seq achieves a higher level of comprehensiveness compared to vanilla models and strong baselines. All the reproducible codes and datasets are available online.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

thinkwee/DPP_CNN_Summarization
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence