Linking Data Citation to Repository Visibility: An Empirical Study
Fakhri Momeni, Janete Saldanha Bach, Brigitte Mathiak, Peter Mutschke

TL;DR
This empirical study examines how repository visibility influences dataset citation rates, finding that higher web domain visibility correlates with increased citations, but other factors also significantly impact citation impact.
Contribution
It provides new insights into the relationship between repository visibility and dataset citation rates using impact indicators and web metrics across social sciences and economics datasets.
Findings
Higher domain visibility correlates with more dataset citations.
Citation impact varies significantly across datasets and domains.
Visibility is a factor but not the sole determinant of citation counts.
Abstract
In today's data-driven research landscape, dataset visibility and accessibility play a crucial role in advancing scientific knowledge. At the same time, data citation is essential for maintaining academic integrity, acknowledging contributions, validating research outcomes, and fostering scientific reproducibility. As a critical link, it connects scholarly publications with the datasets that drive scientific progress. This study investigates whether repository visibility influences data citation rates. We hypothesize that repositories with higher visibility, as measured by search engine metrics, are associated with increased dataset citations. Using OpenAlex data and repository impact indicators (including the visibility index from Sistrix, the h-index of repositories, and citation metrics such as mean and median citations), we analyze datasets in Social Sciences and Economics to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsResearch Data Management Practices · scientometrics and bibliometrics research · Scientific Computing and Data Management
