Study of keyword extraction techniques for Electric Double Layer   Capacitor domain using text similarity indexes: An experimental analysis

M. Saef Ullah Miah; Junaida Sulaiman; Talha Bin Sarwar; Kamal Z. Zamli; and Rajan Jose

arXiv:2111.07068·cs.IR·May 2, 2022

Study of keyword extraction techniques for Electric Double Layer Capacitor domain using text similarity indexes: An experimental analysis

M. Saef Ullah Miah, Junaida Sulaiman, Talha Bin Sarwar, Kamal Z. Zamli, and Rajan Jose

PDF

1 Repo

TL;DR

This paper evaluates various automated keyword extraction algorithms in the Electric Double Layer Capacitor domain, comparing their similarity to expert keywords using multiple similarity indexes, and identifies the most effective method.

Contribution

It provides an experimental comparison of supervised and unsupervised keyword extraction techniques and assesses their similarity to expert keywords in the EDLC domain.

Findings

01

MultipartiteRank with cosine word vector similarity achieves 92% similarity.

02

Unsupervised algorithms like YAKE and TopicRank also perform well.

03

The study guides NLP researchers in selecting suitable keyword extraction methods for EDLC.

Abstract

Keywords perform a significant role in selecting various topic-related documents quite easily. Topics or keywords assigned by humans or experts provide accurate information. However, this practice is quite expensive in terms of resources and time management. Hence, it is more satisfying to utilize automated keyword extraction techniques. Nevertheless, before beginning the automated process, it is necessary to check and confirm how similar expert-provided and algorithm-generated keywords are. This paper presents an experimental analysis of similarity scores of keywords generated by different supervised and unsupervised automated keyword extraction algorithms with expert provided keywords from the Electric Double Layer Capacitor (EDLC) domain. The paper also analyses which texts provide better keywords like positive sentences or all sentences of the document. From the unsupervised…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ping543f/kwd-extraction-study
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsElectric