Read, Highlight and Summarize: A Hierarchical Neural Semantic   Encoder-based Approach

Rajeev Bhatt Ambati; Saptarashmi Bandyopadhyay; Prasenjit Mitra

arXiv:1910.03177·cs.LG·November 4, 2019

Read, Highlight and Summarize: A Hierarchical Neural Semantic Encoder-based Approach

Rajeev Bhatt Ambati, Saptarashmi Bandyopadhyay, Prasenjit Mitra

PDF

Open Access 1 Repo

TL;DR

This paper introduces a hierarchical neural semantic encoder-based approach for text summarization that effectively captures long-term dependencies and outperforms previous models by nearly 4 ROUGE points, utilizing linguistic factors and reinforcement learning.

Contribution

It proposes a novel hierarchical Neural Semantic Encoder model with linguistic factor augmentation and a GPU-based reinforcement learning method for improved summarization.

Findings

01

Hierarchical NSE outperforms previous models by nearly 4 ROUGE points.

02

Augmenting linguistic factors improves vocabulary coverage and generalization.

03

The GPU-based reinforcement learning enhances summary quality.

Abstract

Traditional sequence-to-sequence (seq2seq) models and other variations of the attention-mechanism such as hierarchical attention have been applied to the text summarization problem. Though there is a hierarchy in the way humans use language by forming paragraphs from sentences and sentences from words, hierarchical models have usually not worked that much better than their traditional seq2seq counterparts. This effect is mainly because either the hierarchical attention mechanisms are too sparse using hard attention or noisy using soft attention. In this paper, we propose a method based on extracting the highlights of a document; a key concept that is conveyed in a few sentences. In a typical text summarization dataset consisting of documents that are 800 tokens in length (average), capturing long-term dependencies is very important, e.g., the last sentence can be grouped with the first…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rajeev595/RHS_HierNSE
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence