Characterizing in-text citations in scientific articles: A large-scale analysis
Kevin W. Boyack, Nees Jan van Eck, Giovanni Colavizza, Ludo Waltman

TL;DR
This large-scale analysis of over five million articles reveals field-specific patterns and temporal trends in in-text citations, including differences in citation position, age, and mention frequency, highlighting diverse citation behaviors across scientific disciplines.
Contribution
The study provides a comprehensive characterization of in-text citation patterns across multiple scientific fields and over time, revealing significant field-level differences and temporal trends.
Findings
Field-specific differences in citation position and age
Increase in references and mentions over time
Single-mention references are more highly cited
Abstract
We report characteristics of in-text citations in over five million full text articles from two large databases - the PubMed Central Open Access subset and Elsevier journals - as functions of time, textual progression, and scientific field. The purpose of this study is to understand the characteristics of in-text citations in a detailed way prior to pursuing other studies focused on answering more substantive research questions. As such, we have analyzed in-text citations in several ways and report many findings here. Perhaps most significantly, we find that there are large field-level differences that are reflected in position within the text, citation interval (or reference age), and citation counts of references. In general, the fields of Biomedical and Health Sciences, Life and Earth Sciences, and Physical Sciences and Engineering have similar reference distributions, although they…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicsscientometrics and bibliometrics research · Biomedical Text Mining and Ontologies · Advanced Text Analysis Techniques
