PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization
Jingqing Zhang, Yao Zhao, Mohammad Saleh, Peter J. Liu

TL;DR
PEGASUS introduces a novel pre-training method for Transformer models using gap-sentence generation, significantly improving abstractive summarization across diverse domains and low-resource settings.
Contribution
The paper proposes a new self-supervised pre-training objective for Transformer models, tailored specifically for abstractive summarization, and demonstrates its effectiveness across multiple datasets.
Findings
Achieves state-of-the-art ROUGE scores on 12 summarization datasets.
Surpasses previous models in low-resource scenarios with only 1000 examples.
Human evaluation confirms the quality of generated summaries.
Abstract
Recent work pre-training Transformers with self-supervised objectives on large text corpora has shown great success when fine-tuned on downstream NLP tasks including text summarization. However, pre-training objectives tailored for abstractive text summarization have not been explored. Furthermore there is a lack of systematic evaluation across diverse domains. In this work, we propose pre-training large Transformer-based encoder-decoder models on massive text corpora with a new self-supervised objective. In PEGASUS, important sentences are removed/masked from an input document and are generated together as one output sequence from the remaining sentences, similar to an extractive summary. We evaluated our best PEGASUS model on 12 downstream summarization tasks spanning news, science, stories, instructions, emails, patents, and legislative bills. Experiments demonstrate it achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗google/pegasus-cnn_dailymailmodel· 10k dl· ♡ 11010k dl♡ 110
- 🤗ccdv/lsg-pegasus-large-4096model· 13 dl13 dl
- 🤗google/pegasus-aeslcmodel· 18 dl18 dl
- 🤗google/pegasus-arxivmodel· 470 dl· ♡ 2470 dl♡ 2
- 🤗google/pegasus-billsummodel· 134 dl· ♡ 4134 dl♡ 4
- 🤗google/pegasus-gigawordmodel· 25 dl· ♡ 225 dl♡ 2
- 🤗google/pegasus-largemodel· 5.8k dl· ♡ 1045.8k dl♡ 104
- 🤗google/pegasus-multi_newsmodel· 64 dl· ♡ 2664 dl♡ 26
- 🤗google/pegasus-newsroommodel· 34 dl· ♡ 1634 dl♡ 16
- 🤗google/pegasus-pubmedmodel· 413 dl· ♡ 9413 dl♡ 9
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
MethodsPEGASUS
