Cross-Domain Keyword Extraction with Keyness Patterns

Dongmei Zhou; Xuri Tang

arXiv:2409.18724·cs.IR·September 30, 2024

Cross-Domain Keyword Extraction with Keyness Patterns

Dongmei Zhou, Xuri Tang

PDF

Open Access

TL;DR

This paper introduces a supervised neural network approach for keyword extraction that leverages community-level keyness patterns, demonstrating state-of-the-art results and robustness across multiple domains.

Contribution

It proposes a novel supervised ranking method using convolutional neural networks to learn keyness patterns, addressing domain dependence and annotation subjectivity in keyword extraction.

Findings

01

Achieves an average top-10-F-measure of 0.316 on ten datasets

02

Demonstrates robust cross-domain performance with 0.346 F-measure

03

Outperforms existing methods in supervised keyword extraction

Abstract

Domain dependence and annotation subjectivity pose challenges for supervised keyword extraction. Based on the premises that second-order keyness patterns are existent at the community level and learnable from annotated keyword extraction datasets, this paper proposes a supervised ranking approach to keyword extraction that ranks keywords with keyness patterns consisting of independent features (such as sublanguage domain and term length) and three categories of dependent features -- heuristic features, specificity features, and representavity features. The approach uses two convolutional-neural-network based models to learn keyness patterns from keyword datasets and overcomes annotation subjectivity by training the two models with bootstrap sampling strategy. Experiments demonstrate that the approach not only achieves state-of-the-art performance on ten keyword datasets in general…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Text Analysis Techniques · Information Retrieval and Search Behavior