Counterfactual Data Augmentation improves Factuality of Abstractive   Summarization

Dheeraj Rajagopal; Siamak Shakeri; Cicero Nogueira dos Santos; Eduard; Hovy; Chung-Ching Chang

arXiv:2205.12416·cs.CL·May 26, 2022·6 cites

Counterfactual Data Augmentation improves Factuality of Abstractive Summarization

Dheeraj Rajagopal, Siamak Shakeri, Cicero Nogueira dos Santos, Eduard, Hovy, Chung-Ching Chang

PDF

Open Access

TL;DR

This paper introduces counterfactual data augmentation techniques that enhance the factual accuracy of abstractive summarization models without compromising their overall quality, demonstrated on popular datasets.

Contribution

The authors propose three novel augmentation methods using entity replacement and hypernym substitution to improve factual correctness in summarization.

Findings

01

Factual correctness improved by about 2.5 points on CNN/Dailymail and XSum datasets.

02

Augmentation does not significantly affect ROUGE scores.

03

Methods increase training data diversity and factual consistency.

Abstract

Abstractive summarization systems based on pretrained language models often generate coherent but factually inconsistent sentences. In this paper, we present a counterfactual data augmentation approach where we augment data with perturbed summaries that increase the training data diversity. Specifically, we present three augmentation approaches based on replacing (i) entities from other and the same category and (ii) nouns with their corresponding WordNet hypernyms. We show that augmenting the training data with our approach improves the factual correctness of summaries without significantly affecting the ROUGE score. We show that in two commonly used summarization datasets (CNN/Dailymail and XSum), we improve the factual correctness by about 2.5 points on average

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques