Improving Sequence-to-Sequence Models for Abstractive Text Summarization   Using Meta Heuristic Approaches

Aditya Saxena; Ashutosh Ranjan

arXiv:2403.16247·cs.CL·March 26, 2024·1 cites

Improving Sequence-to-Sequence Models for Abstractive Text Summarization Using Meta Heuristic Approaches

Aditya Saxena, Ashutosh Ranjan

PDF

Open Access

TL;DR

This paper explores enhancing sequence-to-sequence models for abstractive text summarization by applying meta-heuristic approaches to optimize hyperparameters and model configurations, tested on CNN/DailyMail dataset.

Contribution

It introduces the use of meta-heuristic techniques to fine-tune seq2seq models for better summarization performance, which is a novel approach in this context.

Findings

01

Meta-heuristic optimization improves model performance

02

Fine-tuning hyperparameters enhances summary quality

03

Experimental results on CNN/DailyMail dataset validate effectiveness

Abstract

As human society transitions into the information age, reduction in our attention span is a contingency, and people who spend time reading lengthy news articles are decreasing rapidly and the need for succinct information is higher than ever before. Therefore, it is essential to provide a quick overview of important news by concisely summarizing the top news article and the most intuitive headline. When humans try to make summaries, they extract the essential information from the source and add useful phrases and grammatical annotations from the original extract. Humans have a unique ability to create abstractions. However, automatic summarization is a complicated problem to solve. The use of sequence-to-sequence (seq2seq) models for neural abstractive text summarization has been ascending as far as prevalence. Numerous innovative strategies have been proposed to develop the current…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management

MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory · Sequence to Sequence