Abstractive and mixed summarization for long-single documents

Roger Barrull; Jugal Kalita

arXiv:2007.01918·cs.CL·July 7, 2020

Abstractive and mixed summarization for long-single documents

Roger Barrull, Jugal Kalita

PDF

Open Access

TL;DR

This paper explores abstractive and mixed summarization techniques for long scientific documents, demonstrating that hierarchical encoder models outperform others in capturing document structure.

Contribution

It introduces a comparative analysis of six models on scientific papers, highlighting the effectiveness of hierarchical encoders for long document summarization.

Findings

01

Hierarchical encoder models outperform other architectures.

02

Transformer-based models with reinforcement learning show improved results.

03

Long scientific papers can be effectively summarized using these models.

Abstract

The lack of diversity in the datasets available for automatic summarization of documents has meant that the vast majority of neural models for automatic summarization have been trained with news articles. These datasets are relatively small, with an average size of about 600 words, and the models trained with such data sets see their performance limited to short documents. In order to surmount this problem, this paper uses scientific papers as the dataset on which different models are trained. These models have been chosen based on their performance on the CNN/Daily Mail data set, so that the highest ranked model of each architectural variant is selected. In this work, six different models are compared, two with an RNN architecture, one with a CNN architecture, two with a Transformer architecture and one with a Transformer architecture combined with reinforcement learning. The results…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Multi-Head Attention · Residual Connection · Attention Is All You Need · *Communicated@Fast*How Do I Communicate to Expedia? · Adam · Softmax · Dense Connections