Integration of storage endpoints into a Rucio data lake, as an activity to prototype a SKA Regional Centres Network
Manuel Parra-Roy\'on, Jes\'us S\'anchez-Casta\~neda, Juli\'an, Garrido, Susana S\'anchez-Exp\'osito, Rohini Joshi, James Collinson, and Rob Barnsley, Jes\'us Salgado, Lourdes Verdes-Montenegro

TL;DR
This paper describes the deployment of new storage endpoints in a Rucio data lake prototype for the SKA Regional Centres Network, addressing data management challenges for the SKA telescopes.
Contribution
It presents the process and instructions for integrating a Rucio Storage Element based on StoRM and WebDAV into the SKA data lake prototype.
Findings
Successful deployment of new storage endpoints within the Rucio data lake
Enhanced data management capabilities for SKA Regional Centres
Guidelines for deploying Rucio Storage Elements using StoRM and WebDAV
Abstract
The Square Kilometre Array (SKA) infrastructure will consist of two radio telescopes that will be the most sensitive telescopes on Earth. The SKA community will have to process and manage near exascale data, which will be a technical challenge for the coming years. In this respect, the SKA Global Network of Regional Centres plays a key role in data distribution and management. The SRCNet will provide distributed computing and data storage capacity, as well as other important services for the network. Within the SRCNet, several teams have been set up for the research, design and development of 5 prototypes. One of these prototypes is related to data management and distribution, where a data lake has been deployed using Rucio. In this paper we focus on the tasks performed by several of the teams to deploy new storage endpoints within the SKAO data lake. In particular, we will describe the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSatellite Communication Systems · Radio Astronomy Observations and Technology · Distributed and Parallel Computing Systems
