# Evaluation of DVFS techniques on modern HPC processors and accelerators   for energy-aware applications

**Authors:** Enrico Calore, Alessandro Gabbana, Sebastiano Fabio Schifano, Raffaele, Tripiccione

arXiv: 1703.02788 · 2017-03-09

## TL;DR

This paper evaluates DVFS techniques on modern HPC processors and accelerators, analyzing energy and performance trade-offs for energy-efficient computing in large-scale HPC environments.

## Contribution

It provides a comprehensive assessment of DVFS-assisted energy tuning on NVIDIA K80 GPU and Intel Haswell CPU, with practical strategies for energy savings.

## Key findings

- Significant energy savings achieved with function-by-function frequency tuning.
- Trade-offs between energy-to-solution and time-to-solution identified.
- Energy-performance model guides effective DVFS strategies.

## Abstract

Energy efficiency is becoming increasingly important for computing systems, in particular for large scale HPC facilities. In this work we evaluate, from an user perspective, the use of Dynamic Voltage and Frequency Scaling (DVFS) techniques, assisted by the power and energy monitoring capabilities of modern processors in order to tune applications for energy efficiency. We run selected kernels and a full HPC application on two high-end processors widely used in the HPC context, namely an NVIDIA K80 GPU and an Intel Haswell CPU. We evaluate the available trade-offs between energy-to-solution and time-to-solution, attempting a function-by-function frequency tuning. We finally estimate the benefits obtainable running the full code on a HPC multi-GPU node, with respect to default clock frequency governors. We instrument our code to accurately monitor power consumption and execution time without the need of any additional hardware, and we enable it to change CPUs and GPUs clock frequencies while running. We analyze our results on the different architectures using a simple energy-performance model, and derive a number of energy saving strategies which can be easily adopted on recent high-end HPC systems for generic applications.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1703.02788/full.md

## Figures

19 figures with captions in the complete paper: https://tomesphere.com/paper/1703.02788/full.md

## References

38 references — full list in the complete paper: https://tomesphere.com/paper/1703.02788/full.md

---
Source: https://tomesphere.com/paper/1703.02788