Data Preservation in High Energy Physics
Roman Kogler, David M. South, Michael Steder

TL;DR
This paper discusses the importance, challenges, and current strategies for preserving high-energy physics data, emphasizing the need for long-term access to unique experimental datasets amidst evolving technological and organizational landscapes.
Contribution
It provides a comprehensive review of the status, challenges, and recommendations for data preservation in high energy physics, based on the DPHEP study group’s findings.
Findings
Highlighting the critical need for data preservation in HEP.
Identifying technological and organizational challenges.
Proposing models and governance for long-term data access.
Abstract
Data from high-energy physics experiments are collected with significant financial and human effort and are mostly unique. However, until recently no coherent strategy existed for data preservation and re-use, and many important and complex data sets have simply been lost. While the current focus is on the LHC at CERN, in the current period several important and unique experimental programs at other facilities are coming to an end, including those at HERA, b-factories and the Tevatron. To address this issue, an inter-experimental study group on HEP data preservation and long-term analysis (DPHEP) was convened at the end of 2008. The group now aims to publish a full and detailed review of the present status of data preservation in high energy physics. This contribution summarises the results of the DPHEP study group, describing the challenges of data preservation in high energy physics…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
