Entity-level Factual Consistency of Abstractive Text Summarization

Feng Nan; Ramesh Nallapati; Zhiguo Wang; Cicero Nogueira dos Santos,; Henghui Zhu; Dejiao Zhang; Kathleen McKeown; Bing Xiang

arXiv:2102.09130·cs.CL·February 19, 2021·5 cites

Entity-level Factual Consistency of Abstractive Text Summarization

Feng Nan, Ramesh Nallapati, Zhiguo Wang, Cicero Nogueira dos Santos,, Henghui Zhu, Dejiao Zhang, Kathleen McKeown, Bing Xiang

PDF

Open Access 1 Repo

TL;DR

This paper introduces new metrics for evaluating entity-level factual consistency in abstractive summarization, identifies data filtering as a solution to entity hallucination, and proposes joint entity-summary generation methods for improved accuracy.

Contribution

It presents novel metrics for entity-level factuality, demonstrates data filtering reduces hallucination, and introduces joint entity and summary generation techniques.

Findings

01

New metrics effectively measure entity-level factual consistency.

02

Filtering training data reduces entity hallucination.

03

Joint entity-summary generation improves factual accuracy.

Abstract

A key challenge for abstractive summarization is ensuring factual consistency of the generated summary with respect to the original document. For example, state-of-the-art models trained on existing datasets exhibit entity hallucination, generating names of entities that are not present in the source document. We propose a set of new metrics to quantify the entity-level factual consistency of generated summaries and we show that the entity hallucination problem can be alleviated by simply filtering the training data. In addition, we propose a summary-worthy entity classification task to the training process as well as a joint entity and summary generation approach, which yield further improvements in entity level metrics.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

amazon-research/fact-check-summarization
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques