Using Full-text Content of Academic Articles to Build a Methodology Taxonomy of Information Science in China
Heng Zhang, Chengzhi Zhang

TL;DR
This paper presents a semi-automated approach to constructing a detailed and updatable methodology taxonomy for information science by leveraging full-text academic articles and clustering techniques.
Contribution
It introduces a semi-automated method that expands and updates methodology taxonomies using full-text data and clustering, improving detail and renewal speed.
Findings
Created a multi-level methodology taxonomy for information science.
Enhanced taxonomy detail through clustering of research entities.
Accelerated taxonomy updating process.
Abstract
Research on the construction of traditional information science methodology taxonomy is mostly conducted manually. From the limited corpus, researchers have attempted to summarize some of the research methodology entities into several abstract levels (generally three levels); however, they have been unable to provide a more granular hierarchy. Moreover, updating the methodology taxonomy is traditionally a slow process. In this study, we collected full-text academic papers related to information science. First, we constructed a basic methodology taxonomy with three levels by manual annotation. Then, the word vectors of the research methodology entities were trained using the full-text data. Accordingly, the research methodology entities were clustered and the basic methodology taxonomy was expanded using the clustering results to obtain a methodology taxonomy with more levels. This study…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Topic Modeling · Natural Language Processing Techniques
