On the Heterogeneous Distributions in Paper Citations
Jinhyuk Yun, Sejung Ahn, June Young Lee

TL;DR
This study analyzes 21 years of scientific citation data from over 42 million papers, revealing that normalized citation distributions follow a power-law with an exponential cutoff, highlighting the long-term influence of journal reputation.
Contribution
It provides a large-scale, systematic analysis of citation distributions across disciplines, introducing a method to normalize citations by removing journal reputation effects.
Findings
Raw citation counts follow a log-normal distribution, except in the first year.
Normalized citations follow a power-law distribution with an exponential cutoff.
Journal reputation significantly influences long-term citation patterns.
Abstract
Academic papers have been the protagonists in disseminating expertise. Naturally, paper citation pattern analysis is an efficient and essential way of investigating the knowledge structure of science and technology. For decades, it has been observed that citation of scientific literature follows a heterogeneous and heavy-tailed distribution, and many of them suggest a power-law distribution, log-normal distribution, and related distributions. However, many studies are limited to small-scale approaches; therefore, it is hard to generalize. To overcome this problem, we investigate 21 years of citation evolution through a systematic analysis of the entire citation history of 42,423,644 scientific literatures published from 1996 to 2016 and contained in SCOPUS. We tested six candidate distributions for the scientific literature in three distinct levels of Scimago Journal & Country Rank…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicsscientometrics and bibliometrics research · Complex Network Analysis Techniques · Advanced Text Analysis Techniques
