SumHiS: Extractive Summarization Exploiting Hidden Structure

Tikhonov Pavel; Anastasiya Ianina; Valentin Malykh

arXiv:2406.08215·cs.CL·June 13, 2024

SumHiS: Extractive Summarization Exploiting Hidden Structure

Tikhonov Pavel, Anastasiya Ianina, Valentin Malykh

PDF

Open Access

TL;DR

SumHiS introduces a novel extractive summarization method leveraging hidden text structures, achieving state-of-the-art ROUGE-2 scores on CNN/DailyMail by interpreting text as aspects.

Contribution

The paper presents a new extractive summarization approach that exploits hidden clustering structures in text, improving summary accuracy over existing methods.

Findings

01

Outperforms previous methods on CNN/DailyMail with 10% higher ROUGE-2 score

02

Effectively interprets hidden text structures as aspects

03

Achieves state-of-the-art summarization results

Abstract

Extractive summarization is a task of highlighting the most important parts of the text. We introduce a new approach to extractive summarization task using hidden clustering structure of the text. Experimental results on CNN/DailyMail demonstrate that our approach generates more accurate summaries than both extractive and abstractive methods, achieving state-of-the-art results in terms of ROUGE-2 metric exceeding the previous approaches by 10%. Additionally, we show that hidden structure of the text could be interpreted as aspects.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Data Mining Algorithms and Applications · Web Data Mining and Analysis