How are research data referenced? The use case of the research data repository RADAR

Dorothea Strecker; Kerstin Soltau; Felix Bach

arXiv:2505.08533·cs.DL·May 14, 2025

How are research data referenced? The use case of the research data repository RADAR

Dorothea Strecker, Kerstin Soltau, Felix Bach

PDF

TL;DR

This study analyzes how research datasets from the RADAR repository are referenced, revealing limited citation practices, the role of data availability statements, and patterns of data reuse and authorship overlap.

Contribution

It provides empirical insights into the current state of data referencing in RADAR, highlighting gaps and potential areas for improving data citation practices.

Findings

01

27.9% of datasets were referenced at least once

02

21.4% of references are formal data citations

03

Most references occur in data availability statements

Abstract

Publishing research data aims to improve the transparency of research results and facilitate the reuse of datasets. In both cases, referencing the datasets that were used is recommended. Research data repositories can support data referencing through various measures and also benefit from it, for example using this information to demonstrate their impact. However, the literature shows that the practice of formally citing research data is not widespread, data metrics are not yet established, and effective incentive structures are lacking. This article examines how often and in what form datasets published via the research data repository RADAR are referenced. For this purpose, the data sources Google Scholar, DataCite Event Data and the Data Citation Corpus were analyzed. The analysis shows that 27.9 % of the datasets in the repository were referenced at least once. 21.4 % of these…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.