Citation count distributions for large monodisciplinary journals
Mike Thelwall

TL;DR
This study compares statistical models for citation counts in large journals, finding the discretised lognormal distribution fits better than the hooked power law, which impacts how citation impact is analyzed.
Contribution
It provides evidence that the discretised lognormal distribution is more suitable for modeling citation data in individual journals, challenging previous findings at broader levels.
Findings
Discretised lognormal fits better than hooked power law in most cases.
Results challenge previous findings at the subcategory level.
Improved software for fitting the hooked power law is included.
Abstract
Many different citation-based indicators are used by researchers and research evaluators to help evaluate the impact of scholarly outputs. Although the appropriateness of individual citation indicators depends in part on the statistical properties of citation counts, there is no universally agreed best-fitting statistical distribution against which to check them. The two current leading candidates are the discretised lognormal and the hooked or shifted power law. These have been mainly tested on sets of articles from a single field and year but these collections can include multiple specialisms that might dilute their properties. This article fits statistical distributions to 50 large subject-specific journals in the belief that individual journals can be purer than subject categories and may therefore give clearer findings. The results show that in most cases the discretised lognormal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
