Read, Highlight and Summarize: A Hierarchical Neural Semantic Encoder-based Approach
Rajeev Bhatt Ambati, Saptarashmi Bandyopadhyay, Prasenjit Mitra

TL;DR
This paper introduces a hierarchical neural semantic encoder-based approach for text summarization that effectively captures long-term dependencies and outperforms previous models by nearly 4 ROUGE points, utilizing linguistic factors and reinforcement learning.
Contribution
It proposes a novel hierarchical Neural Semantic Encoder model with linguistic factor augmentation and a GPU-based reinforcement learning method for improved summarization.
Findings
Hierarchical NSE outperforms previous models by nearly 4 ROUGE points.
Augmenting linguistic factors improves vocabulary coverage and generalization.
The GPU-based reinforcement learning enhances summary quality.
Abstract
Traditional sequence-to-sequence (seq2seq) models and other variations of the attention-mechanism such as hierarchical attention have been applied to the text summarization problem. Though there is a hierarchy in the way humans use language by forming paragraphs from sentences and sentences from words, hierarchical models have usually not worked that much better than their traditional seq2seq counterparts. This effect is mainly because either the hierarchical attention mechanisms are too sparse using hard attention or noisy using soft attention. In this paper, we propose a method based on extracting the highlights of a document; a key concept that is conveyed in a few sentences. In a typical text summarization dataset consisting of documents that are 800 tokens in length (average), capturing long-term dependencies is very important, e.g., the last sentence can be grouped with the first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence
