Cross-Domain Keyword Extraction with Keyness Patterns
Dongmei Zhou, Xuri Tang

TL;DR
This paper introduces a supervised neural network approach for keyword extraction that leverages community-level keyness patterns, demonstrating state-of-the-art results and robustness across multiple domains.
Contribution
It proposes a novel supervised ranking method using convolutional neural networks to learn keyness patterns, addressing domain dependence and annotation subjectivity in keyword extraction.
Findings
Achieves an average top-10-F-measure of 0.316 on ten datasets
Demonstrates robust cross-domain performance with 0.346 F-measure
Outperforms existing methods in supervised keyword extraction
Abstract
Domain dependence and annotation subjectivity pose challenges for supervised keyword extraction. Based on the premises that second-order keyness patterns are existent at the community level and learnable from annotated keyword extraction datasets, this paper proposes a supervised ranking approach to keyword extraction that ranks keywords with keyness patterns consisting of independent features (such as sublanguage domain and term length) and three categories of dependent features -- heuristic features, specificity features, and representavity features. The approach uses two convolutional-neural-network based models to learn keyness patterns from keyword datasets and overcomes annotation subjectivity by training the two models with bootstrap sampling strategy. Experiments demonstrate that the approach not only achieves state-of-the-art performance on ten keyword datasets in general…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Information Retrieval and Search Behavior
