Improving the Robustness of Summarization Models by Detecting and   Removing Input Noise

Kundan Krishna; Yao Zhao; Jie Ren; Balaji Lakshminarayanan; Jiaming; Luo; Mohammad Saleh; Peter J. Liu

arXiv:2212.09928·cs.CL·December 5, 2023

Improving the Robustness of Summarization Models by Detecting and Removing Input Noise

Kundan Krishna, Yao Zhao, Jie Ren, Balaji Lakshminarayanan, Jiaming, Luo, Mohammad Saleh, Peter J. Liu

PDF

Open Access

TL;DR

This paper investigates how input noise affects summarization models' performance and introduces a simple, training-free method to detect and remove noise, significantly improving robustness across datasets.

Contribution

The paper provides a comprehensive empirical analysis of noise impact on summarization and proposes a lightweight, effective noise removal technique without additional training.

Findings

01

Input noise can reduce ROUGE-1 scores by up to 12 points.

02

The proposed noise detection method recovers up to 11 ROUGE-1 points.

03

The approach works across different datasets and model sizes.

Abstract

The evaluation of abstractive summarization models typically uses test data that is identically distributed as training data. In real-world practice, documents to be summarized may contain input noise caused by text extraction artifacts or data pipeline bugs. The robustness of model performance under distribution shift caused by such noise is relatively under-studied. We present a large empirical study quantifying the sometimes severe loss in performance (up to 12 ROUGE-1 points) from different types of input noise for a range of datasets and model sizes. We then propose a light-weight method for detecting and removing such noise in the input during model inference without requiring any extra training, auxiliary models, or even prior knowledge of the type of noise. Our proposed approach effectively mitigates the loss in performance, recovering a large fraction of the performance drop,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies

MethodsTest