Neural Abstractive Text Summarization with Sequence-to-Sequence Models
Tian Shi, Yaser Keneshloo, Naren Ramakrishnan, Chandan K. Reddy

TL;DR
This paper surveys neural seq2seq models for abstractive text summarization, discusses their techniques, and introduces an open-source toolkit with experiments and benchmarks on multiple datasets.
Contribution
It provides a comprehensive review of seq2seq models for summarization, introduces the NATS toolkit, and benchmarks models on new datasets.
Findings
Different neural components improve summarization quality
The NATS toolkit facilitates model development and evaluation
Benchmark results highlight model strengths and weaknesses
Abstract
In the past few years, neural abstractive text summarization with sequence-to-sequence (seq2seq) models have gained a lot of popularity. Many interesting techniques have been proposed to improve seq2seq models, making them capable of handling different challenges, such as saliency, fluency and human readability, and generate high-quality summaries. Generally speaking, most of these techniques differ in one of these three categories: network structure, parameter inference, and decoding/generation. There are also other concerns, such as efficiency and parallelism for training a model. In this paper, we provide a comprehensive literature survey on different seq2seq models for abstractive text summarization from the viewpoint of network structures, training strategies, and summary generation algorithms. Several models were first proposed for language modeling and generation tasks, such as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence
