Machine Learning Algorithms for Active Monitoring of High Performance Computing as a Service (HPCaaS) Cloud Environments
Gianluca Longoni (1), Ryan LaMothe (1), Jeremy Teuton (1), Mark, Greaves (1), Nicole Nichols (1), William Smith (1) ((1) Pacific Northwest, National Laboratory)

TL;DR
This paper investigates the use of machine learning to identify engineering applications running on cloud-based HPC environments, utilizing privacy-preserving billing data to classify different computational workloads.
Contribution
It introduces a method for classifying HPC applications on cloud infrastructure using privacy-preserving billing features and demonstrates its effectiveness with real-world engineering codes.
Findings
Successful classification of different HPC applications using billing data
Demonstrated privacy-preserving application identification in cloud HPC environments
Applicable to various cloud providers and HPC workloads
Abstract
Cloud computing provides ubiquitous and on-demand access to vast reconfigurable resources that can meet any computational need. Many service models are available, but the Infrastructure as a Service (IaaS) model is particularly suited to operate as a high performance computing (HPC) platform, by networking large numbers of cloud computing nodes. We used the Pacific Northwest National Laboratory (PNNL) cloud computing environment to perform our experiments. A number of cloud computing providers such as Amazon Web Services, Microsoft Azure, or IBM Cloud, offer flexible and scalable computing resources. This paper explores the viability identifying types of engineering applications running on a cloud infrastructure configured as an HPC platform using privacy preserving features as input to statistical models. The engineering applications considered in this work include MCNP6, a radiation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Distributed and Parallel Computing Systems · Advanced Data Storage Technologies
