Strategies to Measure Energy Consumption Using RAPL During Workflow Execution on Commodity Clusters
Philipp Thamm, Ulf Leser

TL;DR
This paper explores three methods to measure energy consumption during scientific workflows on Kubernetes clusters using Intel RAPL, evaluating their accuracy, ease of implementation, and challenges, to promote energy-efficient computing.
Contribution
It introduces and compares three practical methods for RAPL-based energy measurement in Kubernetes environments, highlighting their advantages, limitations, and implementation considerations.
Findings
Shell script and Nextflow plugin methods are effective and easy to implement.
Single-task energy measurement is straightforward, but concurrent tasks require approximation.
Using CPU utilization metrics helps estimate energy for concurrent tasks.
Abstract
In science, problems in many fields can be solved by processing datasets using a series of computationally expensive algorithms, sometimes referred to as workflows. Traditionally, the configurations of these workflows are optimized to achieve a short runtime for the given task and dataset on a given (often distributed) infrastructure. However, recently more attention has been drawn to energy-efficient computing, due to the negative impact of energy-inefficient computing on the environment and energy costs. To be able to assess the energy-efficiency of a given workflow configuration, reliable and accurate methods to measure the energy consumption of a system are required. One approach is the usage of built-in hardware energy counters, such as Intel RAPL. Unfortunately, effectively using RAPL for energy measurement within a workflow on a managed cluster with the typical deep software…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Cloud Computing and Resource Management · Machine Learning in Materials Science
