Local Word Vectors Guiding Keyphrase Extraction

Eirini Papagiannopoulou; Grigorios Tsoumakas

arXiv:1710.07503·cs.CL·April 16, 2018

Local Word Vectors Guiding Keyphrase Extraction

Eirini Papagiannopoulou, Grigorios Tsoumakas

PDF

Open Access 1 Repo

TL;DR

This paper introduces an unsupervised keyphrase extraction method that leverages local word embeddings trained on individual documents, improving accuracy over traditional global embeddings and existing methods.

Contribution

It proposes a novel approach using document-specific GloVe embeddings for keyphrase extraction, enhancing semantic relevance and extraction quality.

Findings

01

Local embeddings outperform global embeddings in keyphrase extraction

02

The method surpasses state-of-the-art unsupervised approaches

03

Empirical results confirm improved extraction accuracy

Abstract

Automated keyphrase extraction is a fundamental textual information processing task concerned with the selection of representative phrases from a document that summarize its content. This work presents a novel unsupervised method for keyphrase extraction, whose main innovation is the use of local word embeddings (in particular GloVe vectors), i.e., embeddings trained from the single document under consideration. We argue that such local representation of words and keyphrases are able to accurately capture their semantics in the context of the document they are part of, and therefore can help in improving keyphrase extraction quality. Empirical results offer evidence that indeed local representations lead to better keyphrase extraction results compared to both embeddings trained on very large third corpora or larger corpora consisting of several documents of the same scientific field and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

epapagia/RVA
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Text Analysis Techniques

MethodsGloVe Embeddings