Topic-Aware Encoding for Extractive Summarization
Mingyang Song, Liping Jing

TL;DR
This paper introduces a topic-aware encoding method for extractive summarization that incorporates central topic information into sentence representations, improving the quality of summaries especially for long documents.
Contribution
It proposes a novel model combining syntactic and topic-level information with neural topic modeling to enhance extractive summarization performance.
Findings
Outperforms state-of-the-art models on three datasets
Effectively captures central topics for better summary relevance
Improves summarization quality for lengthy documents
Abstract
Document summarization provides an instrument for faster understanding the collection of text documents and has several real-life applications. With the growth of online text data, numerous summarization models have been proposed recently. The Sequence-to-Sequence (Seq2Seq) based neural summarization model is the most widely used in the summarization field due to its high performance. This is because semantic information and structure information in the text is adequately considered when encoding. However, the existing extractive summarization models pay little attention to and use the central topic information to assist the generation of summaries, which leads to models not ensuring the generated summary under the primary topic. A lengthy document can span several topics, and a single summary cannot do justice to all the topics. Therefore, the key to generating a high-quality summary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
