Learning-Based Single-Document Summarization with Compression and   Anaphoricity Constraints

Greg Durrett; Taylor Berg-Kirkpatrick; and Dan Klein

arXiv:1603.08887·cs.CL·June 9, 2016·32 cites

Learning-Based Single-Document Summarization with Compression and Anaphoricity Constraints

Greg Durrett, Taylor Berg-Kirkpatrick, and Dan Klein

PDF

Open Access

TL;DR

This paper introduces a discriminative model for single-document summarization that combines compression and anaphoricity constraints, improving summary quality by ensuring coherence and relevance through learned features and dependency rules.

Contribution

The work presents a novel integrated framework that jointly models compression and anaphoricity constraints, trained end-to-end for enhanced summarization performance.

Findings

01

Outperforms prior models on ROUGE scores

02

Achieves higher human judgment scores of linguistic quality

03

Effectively maintains coherence through anaphoricity constraints

Abstract

We present a discriminative model for single-document summarization that integrally combines compression and anaphoricity constraints. Our model selects textual units to include in the summary based on a rich set of sparse features whose weights are learned on a large corpus. We allow for the deletion of content within a sentence when that deletion is licensed by compression rules; in our framework, these are implemented as dependencies between subsentential units of text. Anaphoricity constraints then improve cross-sentence coherence by guaranteeing that, for each pronoun included in the summary, the pronoun's antecedent is included as well or the pronoun is rewritten as a full mention. When trained end-to-end, our final system outperforms prior work on both ROUGE as well as on human judgments of linguistic quality.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies