A New Sentence Extraction Strategy for Unsupervised Extractive   Summarization Methods

Dehao Tao; Yingzhu Xiong; Zhongliang Yang; and Yongfeng Huang

arXiv:2112.03203·cs.CL·January 25, 2024·1 cites

A New Sentence Extraction Strategy for Unsupervised Extractive Summarization Methods

Dehao Tao, Yingzhu Xiong, Zhongliang Yang, and Yongfeng Huang

PDF

Open Access

TL;DR

This paper introduces a novel unsupervised sentence extraction strategy for extractive text summarization, leveraging Information Theory to improve feature distribution and reduce mutual information among sentences, showing effectiveness across datasets.

Contribution

The paper proposes a new sentence extraction strategy based on Information Theory, applicable to existing unsupervised methods, enhancing their performance without large datasets.

Findings

01

Strategy improves summarization quality

02

Effective across multiple datasets

03

Reduces mutual information among sentences

Abstract

In recent years, text summarization methods have attracted much attention again thanks to the researches on neural network models. Most of the current text summarization methods based on neural network models are supervised methods which need large-scale datasets. However, large-scale datasets are difficult to obtain in practical applications. In this paper, we model the task of extractive text summarization methods from the perspective of Information Theory, and then describe the unsupervised extractive methods with a uniform framework. To improve the feature distribution and to decrease the mutual information of summarization sentences, we propose a new sentence extraction strategy which can be applied to existing unsupervised extractive methods. Experiments are carried out on different datasets, and results show that our strategy is indeed effective and in line with expectations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques