Survey on Abstractive Text Summarization: Dataset, Models, and Metrics
Gospel Ozioma Nnadi, Flavio Bertini

TL;DR
This survey reviews recent advancements in abstractive text summarization, focusing on transformer-based models, datasets, evaluation metrics, and their strengths and limitations in producing human-like summaries.
Contribution
It provides a comprehensive overview of datasets, models, and metrics in abstractive summarization, highlighting recent transformer-based approaches and their performance.
Findings
Transformer models significantly improve summarization quality.
Evaluation metrics reveal strengths and limitations of current models.
Open-source code and datasets facilitate further research.
Abstract
The advancements in deep learning, particularly the introduction of transformers, have been pivotal in enhancing various natural language processing (NLP) tasks. These include text-to-text applications such as machine translation, text classification, and text summarization, as well as data-to-text tasks like response generation and image-to-text tasks such as captioning. Transformer models are distinguished by their attention mechanisms, pretraining on general knowledge, and fine-tuning for downstream tasks. This has led to significant improvements, particularly in abstractive summarization, where sections of a source document are paraphrased to produce summaries that closely resemble human expression. The effectiveness of these models is assessed using diverse metrics, encompassing techniques like semantic overlap and factual correctness. This survey examines the state of the art in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Text Analysis Techniques · Natural Language Processing Techniques
MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Absolute Position Encodings · Dense Connections · Multi-Head Attention · Position-Wise Feed-Forward Layer · Label Smoothing · Residual Connection · Adam
