A Discourse-Aware Attention Model for Abstractive Summarization of Long   Documents

Arman Cohan; Franck Dernoncourt; Doo Soon Kim; Trung Bui; Seokhwan; Kim; Walter Chang; Nazli Goharian

arXiv:1804.05685·cs.CL·May 23, 2018

A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents

Arman Cohan, Franck Dernoncourt, Doo Soon Kim, Trung Bui, Seokhwan, Kim, Walter Chang, Nazli Goharian

PDF

3 Repos 3 Models 5 Datasets

TL;DR

This paper introduces a novel hierarchical, discourse-aware neural model designed for abstractive summarization of long documents like research papers, demonstrating significant improvements over existing models on scientific datasets.

Contribution

It presents the first discourse-structured hierarchical encoder and attentive decoder tailored for long document summarization, advancing the capabilities of neural summarization models.

Findings

01

Outperforms state-of-the-art models on scientific paper datasets

02

Effectively models discourse structure for better summaries

03

Significant improvement in long document summarization quality

Abstract

Neural abstractive summarization models have led to promising results in summarizing relatively short documents. We propose the first model for abstractive summarization of single, longer-form documents (e.g., research papers). Our approach consists of a new hierarchical encoder that models the discourse structure of a document, and an attentive discourse-aware decoder to generate the summary. Empirical results on two large-scale datasets of scientific papers show that our model significantly outperforms state-of-the-art models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.