Fast Abstractive Summarization with Reinforce-Selected Sentence   Rewriting

Yen-Chun Chen; Mohit Bansal

arXiv:1805.11080·cs.CL·May 29, 2018·94 cites

Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting

Yen-Chun Chen, Mohit Bansal

PDF

Open Access 3 Repos

TL;DR

This paper introduces a fast, hierarchical abstractive summarization model that selects and rewrites salient sentences, achieving state-of-the-art results with significantly improved speed and abstractiveness.

Contribution

The paper presents a novel sentence-level policy gradient method for hierarchical sentence selection and rewriting, enabling faster inference and higher-quality summaries.

Findings

01

Achieves state-of-the-art performance on CNN/Daily Mail dataset.

02

Enables 10-20x faster inference speed.

03

Demonstrates better generalization on DUC-2002 dataset.

Abstract

Inspired by how humans summarize long documents, we propose an accurate and fast summarization model that first selects salient sentences and then rewrites them abstractively (i.e., compresses and paraphrases) to generate a concise overall summary. We use a novel sentence-level policy gradient method to bridge the non-differentiable computation between these two neural networks in a hierarchical way, while maintaining language fluency. Empirically, we achieve the new state-of-the-art on all metrics (including human evaluation) on the CNN/Daily Mail dataset, as well as significantly higher abstractiveness scores. Moreover, by first operating at the sentence-level and then the word-level, we enable parallel decoding of our neural generative model that results in substantially faster (10-20x) inference speed as well as 4x faster training convergence than previous long-paragraph…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings