Improving Factual Consistency in Summarization with Compression-Based   Post-Editing

Alexander R. Fabbri; Prafulla Kumar Choubey; Jesse Vig; Chien-Sheng; Wu; Caiming Xiong

arXiv:2211.06196·cs.CL·November 14, 2022·1 cites

Improving Factual Consistency in Summarization with Compression-Based Post-Editing

Alexander R. Fabbri, Prafulla Kumar Choubey, Jesse Vig, Chien-Sheng, Wu, Caiming Xiong

PDF

Open Access 1 Repo

TL;DR

This paper introduces a compression-based post-editing method to enhance factual consistency in summarization by removing extrinsic entity errors, achieving significant improvements without sacrificing summary quality.

Contribution

It proposes a novel training approach using sentence compression data to effectively remove extrinsic entity errors in summaries, improving factual accuracy.

Findings

01

Entity precision improved by up to 30% on XSum.

02

Combining with other post-editors increases precision by 38%.

03

Model maintains ROUGE scores while enhancing factual consistency.

Abstract

State-of-the-art summarization models still struggle to be factually consistent with the input text. A model-agnostic way to address this problem is post-editing the generated summaries. However, existing approaches typically fail to remove entity errors if a suitable input entity replacement is not available or may insert erroneous content. In our work, we focus on removing extrinsic entity errors, or entities not in the source, to improve consistency while retaining the summary's essential information and form. We propose to use sentence-compression data to train the post-editing model to take a summary with extrinsic entity errors marked with special tokens and output a compressed, well-formed summary with those errors removed. We show that this model improves factual consistency while maintaining ROUGE, improving entity precision by up to 30% on XSum, and that this model can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

salesforce/compedit
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies

Methodsfail