TL;DR
This large-scale study analyzes cross-lingual citations in over one million English papers, revealing trends, usage patterns, and impacts across disciplines and time, with a focus on Chinese and local non-English citations.
Contribution
It provides the first comprehensive analysis of cross-lingual citations across multiple disciplines and decades, and makes data and code publicly available for future research.
Findings
Increasing citations to Chinese publications.
Most cross-lingual citations are to local non-English languages.
Citation intent is consistent between cross- and monolingual citations.
Abstract
Citation information in scholarly data is an important source of insight into the reception of publications and the scholarly discourse. Outcomes of citation analyses and the applicability of citation based machine learning approaches heavily depend on the completeness of such data. One particular shortcoming of scholarly data nowadays is that non-English publications are often not included in data sets, or that language metadata is not available. Because of this, citations between publications of differing languages (cross-lingual citations) have only been studied to a very limited degree. In this paper, we present an analysis of cross-lingual citations based on over one million English papers, spanning three scientific disciplines and a time span of three decades. Our investigation covers differences between cited languages and disciplines, trends over time, and the usage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
