Improving Zero and Few-Shot Abstractive Summarization with Intermediate Fine-tuning and Data Augmentation
Alexander R. Fabbri, Simeng Han, Haoyuan Li, Haoran Li, Marjan, Ghazvininejad, Shafiq Joty, Dragomir Radev, Yashar Mehdad

TL;DR
This paper introduces WikiTransfer, a novel unsupervised fine-tuning method for pretrained summarization models that improves zero and few-shot performance across diverse datasets by using pseudo-summaries from Wikipedia and data augmentation techniques.
Contribution
The paper presents WikiTransfer, a new unsupervised fine-tuning approach that enhances zero and few-shot summarization, with detailed analysis of dataset components and augmentation effects.
Findings
State-of-the-art zero-shot performance on CNN-DailyMail.
Effective few-shot transfer with only 10-100 examples.
Robustness to noisy data and improved summary quality.
Abstract
Models pretrained with self-supervised objectives on large text corpora achieve state-of-the-art performance on English text summarization tasks. However, these models are typically fine-tuned on hundreds of thousands of data points, an infeasible requirement when applying summarization to new, niche domains. In this work, we introduce a novel and generalizable method, called WikiTransfer, for fine-tuning pretrained models for summarization in an unsupervised, dataset-specific manner. WikiTransfer fine-tunes pretrained models on pseudo-summaries, produced from generic Wikipedia data, which contain characteristics of the target dataset, such as the length and level of abstraction of the desired summaries. WikiTransfer models achieve state-of-the-art, zero-shot abstractive summarization performance on the CNN-DailyMail dataset and demonstrate the effectiveness of our approach on three…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
