Unsupervised Extractive Summarization by Human Memory Simulation

Ronald Cardenas; Matthias Galle; Shay B. Cohen

arXiv:2104.08392·cs.CL·April 20, 2021

Unsupervised Extractive Summarization by Human Memory Simulation

Ronald Cardenas, Matthias Galle, Shay B. Cohen

PDF

Open Access

TL;DR

This paper proposes an unsupervised extractive summarization method that uses heuristics inspired by human memory models to identify important content in long, structured documents, improving relevance and summary quality.

Contribution

It introduces novel heuristics based on cognitive representations of memory for content selection in unsupervised extractive summarization of scientific articles.

Findings

01

Heuristics effectively leverage cognitive structures and document organization.

02

Proposed methods outperform baselines in automatic evaluations.

03

Human assessments confirm improved relevance of extracted content.

Abstract

Summarization systems face the core challenge of identifying and selecting important information. In this paper, we tackle the problem of content selection in unsupervised extractive summarization of long, structured documents. We introduce a wide range of heuristics that leverage cognitive representations of content units and how these are retained or forgotten in human memory. We find that properties of these representations of human memory can be exploited to capture relevance of content units in scientific articles. Experiments show that our proposed heuristics are effective at leveraging cognitive structures and the organization of the document (i.e.\ sections of an article), and automatic and human evaluations provide strong evidence that these heuristics extract more summary-worthy content units.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques