Survey on Abstractive Text Summarization: Dataset, Models, and Metrics

Gospel Ozioma Nnadi; Flavio Bertini

arXiv:2412.17165·cs.AI·December 24, 2024·2 cites

Survey on Abstractive Text Summarization: Dataset, Models, and Metrics

Gospel Ozioma Nnadi, Flavio Bertini

PDF

Open Access 2 Repos

TL;DR

This survey reviews recent advancements in abstractive text summarization, focusing on transformer-based models, datasets, evaluation metrics, and their strengths and limitations in producing human-like summaries.

Contribution

It provides a comprehensive overview of datasets, models, and metrics in abstractive summarization, highlighting recent transformer-based approaches and their performance.

Findings

01

Transformer models significantly improve summarization quality.

02

Evaluation metrics reveal strengths and limitations of current models.

03

Open-source code and datasets facilitate further research.

Abstract

The advancements in deep learning, particularly the introduction of transformers, have been pivotal in enhancing various natural language processing (NLP) tasks. These include text-to-text applications such as machine translation, text classification, and text summarization, as well as data-to-text tasks like response generation and image-to-text tasks such as captioning. Transformer models are distinguished by their attention mechanisms, pretraining on general knowledge, and fine-tuning for downstream tasks. This has led to significant improvements, particularly in abstractive summarization, where sections of a source document are paraphrased to produce summaries that closely resemble human expression. The effectiveness of these models is assessed using diverse metrics, encompassing techniques like semantic overlap and factual correctness. This survey examines the state of the art in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Text Analysis Techniques · Natural Language Processing Techniques

MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Absolute Position Encodings · Dense Connections · Multi-Head Attention · Position-Wise Feed-Forward Layer · Label Smoothing · Residual Connection · Adam