Improving the Robustness of Summarization Systems with Dual Augmentation

Xiuying Chen; Guodong Long; Chongyang Tao; Mingzhe Li; Xin Gao,; Chengqi Zhang; Xiangliang Zhang

arXiv:2306.01090·cs.CL·June 5, 2023·1 cites

Improving the Robustness of Summarization Systems with Dual Augmentation

Xiuying Chen, Guodong Long, Chongyang Tao, Mingzhe Li, Xin Gao,, Chengqi Zhang, Xiangliang Zhang

PDF

Open Access 1 Repo

TL;DR

This paper investigates the robustness of summarization models against input perturbations and noise, proposing dual data augmentation techniques to enhance their resilience and improve performance on noisy and adversarial datasets.

Contribution

It introduces SummAttacker for generating semantic-consistent adversarial samples and a manifold softmixing method to augment training data, significantly boosting robustness of summarization models.

Findings

01

Models' performance drops significantly on adversarial inputs.

02

Data augmentation improves robustness on noisy and attacked datasets.

03

Proposed methods outperform strong baselines on Gigaword and CNN/DM datasets.

Abstract

A robust summarization system should be able to capture the gist of the document, regardless of the specific word choices or noise in the input. In this work, we first explore the summarization models' robustness against perturbations including word-level synonym substitution and noise. To create semantic-consistent substitutes, we propose a SummAttacker, which is an efficient approach to generating adversarial samples based on language models. Experimental results show that state-of-the-art summarization models have a significant decrease in performance on adversarial and noisy test sets. Next, we analyze the vulnerability of the summarization systems and explore improving the robustness by data augmentation. Specifically, the first brittleness factor we found is the poor understanding of infrequent words in the input. Correspondingly, we feed the encoder with more diverse cases…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

iriscxy/robustness
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Hate Speech and Cyberbullying Detection

MethodsTest