Modelling Citation Networks
S.R. Goldberg, H. Anthony, T.S. Evans

TL;DR
This paper introduces a simple three-parameter citation network model that accurately reproduces the distribution of citation counts over time, highlighting the importance of local and global search processes in citation behaviors.
Contribution
The paper presents a novel three-parameter model incorporating local and global search processes to better simulate citation network distributions over time.
Findings
The distribution of citation counts remains approximately constant year to year.
Around 20% of citations use global information, 80% use local searches.
Fluctuations in bibliography size significantly affect the model's accuracy.
Abstract
The distribution of the number of academic publications as a function of citation count for a given year is remarkably similar from year to year. We measure this similarity as a width of the distribution and find it to be approximately constant from year to year. We show that simple citation models fail to capture this behaviour. We then provide a simple three parameter citation network model using a mixture of local and global search processes which can reproduce the correct distribution over time. We use the citation network of papers from the hep-th section of arXiv to test our model. For this data, around 20% of citations use global information to reference recently published papers, while the remaining 80% are found using local searches. We note that this is consistent with other studies though our motivation is very different from previous work. Finally, we also find that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
