Driving Computational Efficiency in Large-Scale Platforms using HPC Technologies
Alexander Martinez Mendez, Antonio J. Rubio-Montero, Carlos J. Barrios H., Hern\'an Asorey, Rafael Mayo-Garc\'ia, and Luis A. N\'u\~nez

TL;DR
This paper analyzes HPC resource utilization in the LAGO astroparticle physics project, identifying inefficiencies and providing data-driven recommendations to optimize computational throughput and scientific productivity.
Contribution
It offers a detailed analysis of resource consumption patterns in LAGO's HPC environment and proposes strategies for improving efficiency based on workload characterization.
Findings
High CPU efficiency in simulation tasks
Short test jobs distort aggregate efficiency metrics
Recommendations for optimizing resource requests and workflows
Abstract
The Latin American Giant Observatory (LAGO) project utilizes extensive High-Performance Computing (HPC) resources for complex astroparticle physics simulations, making resource efficiency critical for scientific productivity and sustainability. This article presents a detailed analysis focused on quantifying and improving HPC resource utilization efficiency specifically within the LAGO computational environment. The core objective is to understand how LAGO's distinct computational workloads-characterized by a prevalent coarse-grained, task-parallel execution model-consume resources in practice. To achieve this, we analyze historical job accounting data from the EGI FedCloud platform, identifying primary workload categories (Monte Carlo simulations, data processing, user analysis/testing) and evaluating their performance using key efficiency metrics (CPU utilization, walltime…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Parallel Computing and Optimization Techniques · Scientific Computing and Data Management
