Periodical embeddings uncover hidden interdisciplinary patterns in the subject classification scheme of science
Zhuoqi Lyu, Qing Ke

TL;DR
This paper introduces a new citation-based embedding method that improves journal classification accuracy and reveals hidden interdisciplinary patterns, surpassing traditional taxonomies in reflecting the structure of scientific knowledge.
Contribution
The study presents a novel periodical embedding framework using citation data, demonstrating its superiority over existing classification schemes in capturing scientific interdisciplinarity.
Findings
Citation-based embeddings outperform traditional taxonomies in classification tasks.
Disciplinary boundaries are reorganized, splitting broad categories and merging fragmented ones.
Interdisciplinary clusters are identified as coherent, dispersed groups in citation space.
Abstract
Subject classification schemes are foundational to the organization, evaluation, and navigation of scientific knowledge. While expert-curated systems like Scopus provide widely used taxonomies, they often suffer from coarse granularity, subjectivity, and limited adaptability to emerging interdisciplinary fields. Data-driven alternatives based on citation networks show promise but lack rigorous, external validation against the semantic content of scientific literature. Here, we propose a novel quantitative framework that leverages classification tasks to evaluate the effectiveness of journal classification schemes. Using over 23 million paper abstracts, we demonstrate that labels derived from k-means clustering on Periodical2Vec (P2V)--a periodical embedding learned from paper-level citations--yield significantly higher classification performance than both Scopus and other data-driven…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicsscientometrics and bibliometrics research · Computational and Text Analysis Methods · Advanced Graph Neural Networks
