Extractive is not Faithful: An Investigation of Broad Unfaithfulness Problems in Extractive Summarization
Shiyue Zhang, David Wan, Mohit Bansal

TL;DR
This paper reveals that extractive summarization can also produce unfaithful summaries with various broad issues, introduces a new detection metric, and emphasizes the need for improved evaluation of extractive methods.
Contribution
It defines a typology of five broad unfaithfulness problems in extractive summarization, and proposes ExtEval, a new metric for detecting these issues, supported by human annotations and experimental validation.
Findings
30% of summaries contain unfaithfulness issues
Existing metrics poorly correlate with human judgments
ExtEval outperforms previous evaluation metrics
Abstract
The problems of unfaithful summaries have been widely discussed under the context of abstractive summarization. Though extractive summarization is less prone to the common unfaithfulness issues of abstractive summaries, does that mean extractive is equal to faithful? Turns out that the answer is no. In this work, we define a typology with five types of broad unfaithfulness problems (including and beyond not-entailment) that can appear in extractive summaries, including incorrect coreference, incomplete coreference, incorrect discourse, incomplete discourse, as well as other misleading information. We ask humans to label these problems out of 1600 English summaries produced by 16 diverse extractive systems. We find that 30% of the summaries have at least one of the five issues. To automatically detect these problems, we find that 5 existing faithfulness evaluation metrics for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
