To Point or Not to Point: Understanding How Abstractive Summarizers   Paraphrase Text

Matt Wilber; William Timkey; Marten Van Schijndel

arXiv:2106.01581·cs.CL·June 4, 2021

To Point or Not to Point: Understanding How Abstractive Summarizers Paraphrase Text

Matt Wilber, William Timkey, Marten Van Schijndel

PDF

TL;DR

This paper investigates how a popular neural summarization model balances copying and generating text, revealing its limited ability to produce truly abstractive and faithful summaries despite high ROUGE scores.

Contribution

The study provides a detailed analysis of the pointer-generator model's strategies and limitations in abstractive summarization, highlighting its reliance on syntactic cues and limited paraphrasing skills.

Findings

01

Model uses syntactic boundaries to truncate sentences on extractive-biased datasets.

02

Forcing generation reveals limited paraphrasing and factual inaccuracies.

03

Model shows limited abstractive abilities on abstractive-biased datasets.

Abstract

Abstractive neural summarization models have seen great improvements in recent years, as shown by ROUGE scores of the generated summaries. But despite these improved metrics, there is limited understanding of the strategies different models employ, and how those strategies relate their understanding of language. To understand this better, we run several experiments to characterize how one popular abstractive model, the pointer-generator model of See et al. (2017), uses its explicit copy/generation switch to control its level of abstraction (generation) vs extraction (copying). On an extractive-biased dataset, the model utilizes syntactic boundaries to truncate sentences that are otherwise often copied verbatim. When we modify the copy/generation switch and force the model to generate, only simple paraphrasing abilities are revealed alongside factual inaccuracies and hallucinations. On…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.