Improving the Faithfulness of Abstractive Summarization via Entity   Coverage Control

Haopeng Zhang; Semih Yavuz; Wojciech Kryscinski; Kazuma Hashimoto,; Yingbo Zhou

arXiv:2207.02263·cs.CL·July 7, 2022

Improving the Faithfulness of Abstractive Summarization via Entity Coverage Control

Haopeng Zhang, Semih Yavuz, Wojciech Kryscinski, Kazuma Hashimoto,, Yingbo Zhou

PDF

Open Access

TL;DR

This paper introduces Entity Coverage Control (ECC), a method to improve faithfulness in abstractive summarization by guiding models to better recognize and include entities, reducing hallucinations in both supervised and zero-shot settings.

Contribution

The paper proposes ECC, a novel entity-level control technique that enhances faithfulness in summarization models, including a zero-shot extension using intermediate fine-tuning on noisy Wikipedia data.

Findings

01

ECC improves faithfulness and salience in summarization.

02

The method outperforms baselines on XSum, Pubmed, and SAMSum datasets.

03

ECC reduces entity hallucinations in both supervised and zero-shot scenarios.

Abstract

Abstractive summarization systems leveraging pre-training language models have achieved superior results on benchmark datasets. However, such models have been shown to be more prone to hallucinate facts that are unfaithful to the input context. In this paper, we propose a method to remedy entity-level extrinsic hallucinations with Entity Coverage Control (ECC). We first compute entity coverage precision and prepend the corresponding control code for each training example, which implicitly guides the model to recognize faithfulness contents in the training phase. We further extend our method via intermediate fine-tuning on large but noisy data extracted from Wikipedia to unlock zero-shot summarization. We show that the proposed method leads to more faithful and salient abstractive summarization in supervised fine-tuning and zero-shot settings according to our experimental results on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques