TL;DR
MUDY introduces a novel framework for unsupervised keyphrase extraction that captures multi-granular contextual salience using prompt-based and self-attention scoring methods, outperforming existing approaches.
Contribution
The paper proposes MUDY, a new context-centric method combining prompt-based and self-attention scoring to improve keyphrase extraction by capturing local and global contextual importance.
Findings
MUDY outperforms state-of-the-art baselines on four datasets.
It effectively captures multi-granular contextual salience.
Quantitative and qualitative analyses validate its efficacy.
Abstract
Keyphrase extraction aims to automatically identify concise phrases that effectively represent the content of a document. While recent methods leveraging pre-trained language models (PLMs) have significantly improved the extraction of keyphrases with strong global semantic relevance, they often fall short in capturing the local contextual importance of keyphrases tied to specific subtopics dispersed in a document. In this paper, we propose a novel context-centric framework, MUDY, that effectively captures multi-granular contextual salience of candidate keyphrases. MUDY employs two complementary components: (1) a prompt-based scoring that estimates the generation likelihood of each candidate keyphrase, augmented with candidate-aware weighting to better reflect its local contextual importance, and (2) a self-attention-based scoring that utilizes multi-granular attention patterns from PLMs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
