Improving Performance of Automatic Keyword Extraction (AKE) Methods Using PoS-Tagging and Enhanced Semantic-Awareness
Enes Altuncu, Jason R.C. Nurse, Yang Xu, Jie Guo, Shujun Li

TL;DR
This paper introduces a post-processing approach that enhances automatic keyword extraction methods by incorporating PoS-tagging and semantic information, leading to significant and consistent performance improvements across multiple datasets.
Contribution
It presents a universal, semantic-aware post-processing technique that can be applied to any AKE method to improve its accuracy and robustness.
Findings
Performance improved up to 100% in improved cases
F1-score increased by 10.2% to 53.8% across methods
Significant and consistent improvements demonstrated on 17 datasets
Abstract
Automatic keyword extraction (AKE) has gained more importance with the increasing amount of digital textual data that modern computing systems process. It has various applications in information retrieval (IR) and natural language processing (NLP), including text summarisation, topic analysis and document indexing. This paper proposes a simple but effective post-processing-based universal approach to improve the performance of any AKE methods, via an enhanced level of semantic-awareness supported by PoS-tagging. To demonstrate the performance of the proposed approach, we considered word types retrieved from a PoS-tagging step and two representative sources of semantic information - specialised terms defined in one or more context-dependent thesauri, and named entities in Wikipedia. The above three steps can be simply added to the end of any AKE methods as part of a post-processor, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Service-Oriented Architecture and Web Services · Information Retrieval and Search Behavior
