CDEvalSumm: An Empirical Study of Cross-Dataset Evaluation for Neural   Summarization Systems

Yiran Chen; Pengfei Liu; Ming Zhong; Zi-Yi Dou; Danqing Wang; Xipeng; Qiu; Xuanjing Huang

arXiv:2010.05139·cs.CL·October 23, 2020·5 cites

CDEvalSumm: An Empirical Study of Cross-Dataset Evaluation for Neural Summarization Systems

Yiran Chen, Pengfei Liu, Ming Zhong, Zi-Yi Dou, Danqing Wang, Xipeng, Qiu, Xuanjing Huang

PDF

Open Access 2 Repos

TL;DR

This paper investigates how neural summarization models trained on one dataset perform on different out-of-domain datasets, revealing insights into their generalization capabilities and limitations across various architectures and methods.

Contribution

It provides a comprehensive cross-dataset evaluation of 11 summarization systems, highlighting factors affecting their generalization and exposing existing limitations.

Findings

01

Model architecture influences generalization ability.

02

Abstractive and extractive methods perform differently across datasets.

03

Current models have notable limitations in out-of-domain settings.

Abstract

Neural network-based models augmented with unsupervised pre-trained knowledge have achieved impressive performance on text summarization. However, most existing evaluation methods are limited to an in-domain setting, where summarizers are trained and evaluated on the same dataset. We argue that this approach can narrow our understanding of the generalization ability for different summarization systems. In this paper, we perform an in-depth analysis of characteristics of different datasets and investigate the performance of different summarization models under a cross-dataset setting, in which a summarizer trained on one corpus will be evaluated on a range of out-of-domain corpora. A comprehensive study of 11 representative summarization systems on 5 datasets from different domains reveals the effect of model architectures and generation ways (i.e. abstractive and extractive) on model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques