A multiple k-means cluster ensemble framework for clustering citation trajectories
Joyita Chakraborty, Dinesh K. Pradhan, Subrata Nandi

TL;DR
This paper introduces a feature-based multiple k-means clustering ensemble to analyze citation trajectories, revealing diverse patterns of article impact over time, with implications for understanding knowledge diffusion and impact prediction.
Contribution
It presents a generalized, parameter-independent clustering framework that captures all trajectory types, overcoming limitations of prior threshold-based methods.
Findings
Identified four distinct citation trajectory patterns.
Achieved linear runtime for large datasets.
Most articles follow specific impact trajectories.
Abstract
Citation maturity time varies for different articles. However, the impact of all articles is measured in a fixed window. Clustering their citation trajectories helps understand the knowledge diffusion process and reveals that not all articles gain immediate success after publication. Moreover, clustering trajectories is necessary for paper impact recommendation algorithms. It is a challenging problem because citation time series exhibit significant variability due to non linear and non stationary characteristics. Prior works propose a set of arbitrary thresholds and a fixed rule based approach. All methods are primarily parameter dependent. Consequently, it leads to inconsistencies while defining similar trajectories and ambiguities regarding their specific number. Most studies only capture extreme trajectories. Thus, a generalised clustering framework is required. This paper proposes a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Advanced Text Analysis Techniques · Bioinformatics and Genomic Networks
MethodsDiffusion · Network On Network
