Density, asymmetry and citation dynamics in scientific literature
Nathaniel Imel, Zachary Hafen

TL;DR
This study explores how the semantic similarity and local density of scientific papers relate to their future citation counts across multiple disciplines, introducing new metrics and predictive models.
Contribution
It introduces two novel metrics, density and asymmetry, to quantify a paper's semantic neighborhood and assesses their predictive power for citation impact.
Findings
Density metrics modestly improve citation prediction models.
Asymmetry does not significantly enhance prediction accuracy.
Semantic neighborhood density carries informative signals about scientific impact.
Abstract
Scientific behavior is often characterized by a tension between building upon established knowledge and introducing novel ideas. Here, we investigate whether this tension is reflected in the relationship between the similarity of a scientific paper to previous research and its eventual citation rate. To operationalize similarity to previous research, we introduce two complementary metrics to characterize the local geometry of a publication's semantic neighborhood: (1) \emph{density} (), defined as the ratio between a fixed number of previously-published papers and the minimum distance enclosing those papers in a semantic embedding space, and (2) asymmetry (), defined as the average directional difference between a paper and its nearest neighbors. We tested the predictive relationship between these two metrics and its subsequent citation rate using a Bayesian hierarchical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicsscientometrics and bibliometrics research · Meta-analysis and systematic reviews · Biomedical Text Mining and Ontologies
