Compressive Summarization with Plausibility and Salience Modeling

Shrey Desai; Jiacheng Xu; Greg Durrett

arXiv:2010.07886·cs.CL·October 16, 2020

Compressive Summarization with Plausibility and Salience Modeling

Shrey Desai, Jiacheng Xu, Greg Durrett

PDF

1 Repo

TL;DR

This paper introduces a data-driven compressive summarization method that uses plausibility and salience models to improve grammaticality and factuality, achieving strong results and cross-domain generalization.

Contribution

It relaxes syntactic constraints by employing plausibility and salience criteria judged by Transformer models for better compression decisions.

Findings

01

Achieves strong in-domain summarization results.

02

Human evaluation confirms grammatical and factual plausibility.

03

Cross-domain adaptation with minimal data is effective.

Abstract

Compressive summarization systems typically rely on a crafted set of syntactic rules to determine what spans of possible summary sentences can be deleted, then learn a model of what to actually delete by optimizing for content selection (ROUGE). In this work, we propose to relax the rigid syntactic constraints on candidate spans and instead leave compression decisions to two data-driven criteria: plausibility and salience. Deleting a span is plausible if removing it maintains the grammaticality and factuality of a sentence, and spans are salient if they contain important information from the summary. Each of these is judged by a pre-trained Transformer model, and only deletions that are both plausible and not salient can be applied. When integrated into a simple extraction-compression pipeline, our method achieves strong in-domain results on benchmark summarization datasets, and human…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shreydesai/cups
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Adam · Byte Pair Encoding · Softmax · Layer Normalization · Dense Connections · Multi-Head Attention · Label Smoothing