Improving Sequence-to-Sequence Models for Abstractive Text Summarization Using Meta Heuristic Approaches
Aditya Saxena, Ashutosh Ranjan

TL;DR
This paper explores enhancing sequence-to-sequence models for abstractive text summarization by applying meta-heuristic approaches to optimize hyperparameters and model configurations, tested on CNN/DailyMail dataset.
Contribution
It introduces the use of meta-heuristic techniques to fine-tune seq2seq models for better summarization performance, which is a novel approach in this context.
Findings
Meta-heuristic optimization improves model performance
Fine-tuning hyperparameters enhances summary quality
Experimental results on CNN/DailyMail dataset validate effectiveness
Abstract
As human society transitions into the information age, reduction in our attention span is a contingency, and people who spend time reading lengthy news articles are decreasing rapidly and the need for succinct information is higher than ever before. Therefore, it is essential to provide a quick overview of important news by concisely summarizing the top news article and the most intuitive headline. When humans try to make summaries, they extract the essential information from the source and add useful phrases and grammatical annotations from the original extract. Humans have a unique ability to create abstractions. However, automatic summarization is a complicated problem to solve. The use of sequence-to-sequence (seq2seq) models for neural abstractive text summarization has been ascending as far as prevalence. Numerous innovative strategies have been proposed to develop the current…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management
MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory · Sequence to Sequence
