When, Where, and How to Open Data: A Personal Perspective
Benjamin Nachman

TL;DR
This paper discusses the strategic considerations for open data sharing in high energy physics, emphasizing the need for better data preservation, accessibility for non-collaborators, and the emergence of 'data physicists' to enhance scientific outcomes.
Contribution
It advocates for improved data preservation, easier engagement for external researchers, and introduces the concept of 'data physicists' to maximize open data utility in high energy physics.
Findings
Open data can significantly expand high energy physics research.
Enhanced data preservation and accessibility are crucial for scientific progress.
The role of 'data physicists' is vital for analyzing open data effectively.
Abstract
This is a personal perspective on data sharing in the context of public data releases suitable for generic analysis. These open data can be a powerful tool for expanding the science of high energy physics, but care must be taken in when, where, and how they are utilized. I argue that data preservation even within collaborations needs additional support in order to maximize our science potential. Additionally, it should also be easier for non-collaboration members to engage with collaborations. Finally, I advocate that we recognize a new type of high energy physicist: the 'data physicist', who would be optimally suited to analyze open data as well as develop and deploy new advanced data science tools so that we can use our precious data to their fullest potential. This document has been coordinated with a white paper on open data commissioned by the American Physical Society's (APS)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data Technologies and Applications · Scientific Computing and Data Management · Advanced Data Storage Technologies
