From Data Processes to Data Products: Knowledge Infrastructures in Astronomy
Christine L. Borgman, Morgan F. Wofford

TL;DR
This paper examines how astronomers develop, maintain, and reuse data products through knowledge infrastructures, emphasizing the role of software pipelines and the challenges in sustaining access to scientific data.
Contribution
It provides a comparative ethnographic analysis of three astronomy groups, highlighting the importance and fragility of software pipelines in astronomical knowledge infrastructures.
Findings
Astronomers encode research methods in software pipelines.
Data products are reprocessed multiple times for new scientific purposes.
Reusing data is fundamental, regardless of public availability.
Abstract
We explore how astronomers take observational data from telescopes, process them into usable scientific data products, curate them for later use, and reuse data for further inquiry. Astronomers have invested heavily in knowledge infrastructures - robust networks of people, artifacts, and institutions that generate, share, and maintain specific knowledge about the human and natural worlds. Drawing upon a decade of interviews and ethnography, this article compares how three astronomy groups capture, process, and archive data, and for whom. The Sloan Digital Sky Survey is a mission with a dedicated telescope and instruments, while the Black Hole Group and Integrative Astronomy Group (both pseudonyms) are university-based, investigator-led collaborations. Findings are organized into four themes: how these projects develop and maintain their workflows; how they capture and archive their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
