Centrality-as-Relevance: Support Sets and Similarity as Geometric Proximity
Ricardo Ribeiro, David Martins de Matos

TL;DR
This paper introduces a novel centrality-based summarization method using support sets and geometric proximity to identify the most relevant content, achieving state-of-the-art results across text and speech summarization tasks.
Contribution
It proposes a new relevance model that leverages support sets and geometric proximity to improve extractive summarization, considering whole sources and minor topics.
Findings
Achieves state-of-the-art performance in text summarization.
Effective in speech and written text summarization.
Language- and domain-independent approach.
Abstract
In automatic summarization, centrality-as-relevance means that the most important content of an information source, or a collection of information sources, corresponds to the most central passages, considering a representation where such notion makes sense (graph, spatial, etc.). We assess the main paradigms, and introduce a new centrality-based relevance model for automatic summarization that relies on the use of support sets to better estimate the relevant content. Geometric proximity is used to compute semantic relatedness. Centrality (relevance) is determined by considering the whole input source (and not only local information), and by taking into account the existence of minor topics or lateral subjects in the information sources to be summarized. The method consists in creating, for each passage of the input source, a support set consisting only of the most semantically related…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
