Topic-Aware Encoding for Extractive Summarization

Mingyang Song; Liping Jing

arXiv:2112.09572·cs.CL·April 1, 2022

Topic-Aware Encoding for Extractive Summarization

Mingyang Song, Liping Jing

PDF

Open Access

TL;DR

This paper introduces a topic-aware encoding method for extractive summarization that incorporates central topic information into sentence representations, improving the quality of summaries especially for long documents.

Contribution

It proposes a novel model combining syntactic and topic-level information with neural topic modeling to enhance extractive summarization performance.

Findings

01

Outperforms state-of-the-art models on three datasets

02

Effectively captures central topics for better summary relevance

03

Improves summarization quality for lengthy documents

Abstract

Document summarization provides an instrument for faster understanding the collection of text documents and has several real-life applications. With the growth of online text data, numerous summarization models have been proposed recently. The Sequence-to-Sequence (Seq2Seq) based neural summarization model is the most widely used in the summarization field due to its high performance. This is because semantic information and structure information in the text is adequately considered when encoding. However, the existing extractive summarization models pay little attention to and use the central topic information to assist the generation of summaries, which leads to models not ensuring the generated summary under the primary topic. A lengthy document can span several topics, and a single summary cannot do justice to all the topics. Therefore, the key to generating a high-quality summary…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques