TL;DR
This study uncovers hidden social science research communities by analyzing citation networks of datasets, revealing how data reuse shapes disciplinary and interdisciplinary structures.
Contribution
It introduces a network analysis approach to identify and characterize communities of data reuse within a large social science data archive.
Findings
Identified distinct communities of datasets linked to specific research fields.
Found datasets at crossroads connect multiple research communities.
Revealed the organization of scientific communities around shared data use.
Abstract
Data archives are an important source of high quality data in many fields, making them ideal sites to study data reuse. By studying data reuse through citation networks, we are able to learn how hidden research communities - those that use the same scientific datasets - are organized. This paper analyzes the community structure of an authoritative network of datasets cited in academic publications, which have been collected by a large, social science data archive: the Interuniversity Consortium for Political and Social Research (ICPSR). Through network analysis, we identified communities of social science datasets and fields of research connected through shared data use. We argue that communities of exclusive data reuse form subdivisions that contain valuable disciplinary resources, while datasets at a "crossroads" broadly connect research communities. Our research reveals the hidden…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
