A Survey of Recent Abstract Summarization Techniques
Diyah Puspitaningrum

TL;DR
This survey reviews recent abstract summarization techniques like T5, Pegasus, and ProphetNet, analyzing their performance across English and Indonesian datasets, and offers suggestions for improving cross-lingual summarization.
Contribution
It provides a comparative analysis of multiple pre-trained models for summarization in two languages and discusses key factors affecting ROUGE scores.
Findings
T5-Large, Pegasus-XSum, and ProphetNet-CNNDM achieve the best results.
Coverage, density, and compression significantly influence ROUGE performance.
Pre-training dataset size and quality are crucial for cross-lingual summarization.
Abstract
This paper surveys several recent abstract summarization methods: T5, Pegasus, and ProphetNet. We implement the systems in two languages: English and Indonesian languages. We investigate the impact of pre-training models (one T5, three Pegasuses, three ProphetNets) on several Wikipedia datasets in English and Indonesian language and compare the results to the Wikipedia systems' summaries. The T5-Large, the Pegasus-XSum, and the ProphetNet-CNNDM provide the best summarization. The most significant factors that influence ROUGE performance are coverage, density, and compression. The higher the scores, the better the summary. Other factors that influence the ROUGE scores are the pre-training goal, the dataset's characteristics, the dataset used for testing the pre-trained model, and the cross-lingual function. Several suggestions to improve this paper's limitation are: 1) assure that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text and Document Classification Technologies
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · ProphetNet · Gated Linear Unit · Adafactor · Refunds@Expedia|||How do I get a full refund from Expedia? · Layer Normalization · Inverse Square Root Schedule · Residual Connection
