Open Science Data Federation -- operation and monitoring
Fabio Andrijauskas, Derek Weitzel, Frank Wuerthwein

TL;DR
The paper presents the Open Science Data Federation (OSDF), a global data access network expanding upon StashCache to enhance data sharing, monitoring, and accounting for scientific research infrastructure.
Contribution
It introduces new features like additional data origins, caches, access methods, and monitoring tools, integrating OSDF into the U.S. cyberinfrastructure landscape.
Findings
OSDF is actively used by multiple research collaborations.
It supports efficient data sharing across diverse scientific projects.
OSDF has become a key component of national cyberinfrastructure.
Abstract
Extensive data processing is becoming commonplace in many fields of science. Distributing data to processing sites and providing methods to share the data with collaborators efficiently has become essential. The Open Science Data Federation (OSDF) builds upon the successful StashCache project to create a global data access network. The OSDF expands the StashCache project to add new data origins and caches, access methods, monitoring, and accounting mechanisms. Additionally, the OSDF has become an integral part of the U.S. national cyberinfrastructure landscape due to the sharing requirements of recent NSF solicitations, which the OSDF is uniquely positioned to enable. The OSDF continues to be utilized by many research collaborations and individual users, which pull the data to many research infrastructures and projects.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
