Total cost of ownership and evaluation of Google cloud resources for the ATLAS experiment at the LHC
The ATLAS Collaboration

TL;DR
This paper evaluates the integration and cost-effectiveness of Google cloud resources for the ATLAS experiment at the LHC, highlighting cost drivers, resource bursting, and future optimization strategies.
Contribution
It presents the first total cost of ownership analysis for cloud resources in high-energy physics and demonstrates effective integration and cost management techniques.
Findings
Cloud resources effectively supplement ATLAS computing capacity.
Network costs are a major expense impacting workflows.
Resource bursting is feasible but incurs significant costs.
Abstract
The ATLAS Google Project was established as part of an ongoing evaluation of the use of commercial clouds by the ATLAS Collaboration, in anticipation of the potential future adoption of such resources by WLCG grid sites to fulfil or complement their computing pledges. Seamless integration of Google cloud resources into the worldwide ATLAS distributed computing infrastructure was achieved at large scale and for an extended period of time, and hence cloud resources are shown to be an effective mechanism to provide additional, flexible computing capacity to ATLAS. For the first time a total cost of ownership analysis has been performed, to identify the dominant cost drivers and explore effective mechanisms for cost control. Network usage significantly impacts the costs of certain ATLAS workflows, underscoring the importance of implementing such mechanisms. Resource bursting has been…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
