Exploring the relevance of ORCID as a source of study of data sharing activities at the individual-level: a methodological discussion
Andrea Sixto-Costoya, Nicolas Robinson-Garcia, Thed N. van Leeuwen and, Rodrigo Costas

TL;DR
This study explores the potential of ORCID profiles as a valuable source for analyzing individual data sharing practices, focusing on dataset publication patterns across countries, disciplines, and career stages.
Contribution
It demonstrates how ORCID can be used to study data sharing activities, highlighting the importance of DataCite and providing insights into researcher behaviors.
Findings
DataCite is the primary source for dataset information in ORCID.
Researchers in Natural Sciences and Medicine share the most datasets.
Researchers who started their PhD around 2015 tend to publish datasets earlier.
Abstract
ORCID is a scientific infrastructure created to solve the problem of author name ambiguity. Over the years ORCID has also become a useful source for studying academic activities reported by researchers. Our objective in this research was to use ORCID to analyze one of these research activities: the publication of datasets. We illustrate how the identification of datasets that shared in researchers' ORCID profiles enables the study of the characteristics of the researchers who have produced them. To explore the relevance of ORCID to study data sharing practices we obtained all ORCID profiles reporting at least one dataset in their "works" list, together with information related to the individual researchers producing the datasets. The retrieved data was organized and analyzed in a SQL database hosted at CWTS. Our results indicate that DataCite is by far the most important data source for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Research Data Management Practices · Scientific Computing and Data Management
