Neural Summarization by Extracting Sentences and Words

Jianpeng Cheng; Mirella Lapata

arXiv:1603.07252·cs.CL·July 4, 2016·128 cites

Neural Summarization by Extracting Sentences and Words

Jianpeng Cheng, Mirella Lapata

PDF

Open Access 1 Repo

TL;DR

This paper introduces a neural network-based extractive summarization method that uses a hierarchical encoder and attention mechanism, achieving state-of-the-art results without linguistic features.

Contribution

It presents a flexible, data-driven framework for extractive summarization that can extract sentences or words using neural networks, bypassing traditional feature engineering.

Findings

01

Achieves results comparable to state-of-the-art methods

02

Operates without linguistic annotation

03

Works on large-scale datasets

Abstract

Traditional approaches to extractive summarization rely heavily on human-engineered features. In this work we propose a data-driven approach based on neural networks and continuous sentence features. We develop a general framework for single-document summarization composed of a hierarchical document encoder and an attention-based extractor. This architecture allows us to develop different classes of summarization models which can extract sentences or words. We train our models on large scale corpora containing hundreds of thousands of document-summary pairs. Experimental results on two summarization datasets demonstrate that our models obtain results comparable to the state of the art without any access to linguistic annotation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

adrian9631/TextSumma
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques