The Impact of GPU DVFS on the Energy and Performance of Deep Learning:   an Empirical Study

Zhenheng Tang; Yuxin Wang; Qiang Wang; Xiaowen Chu

arXiv:1905.11012·cs.PF·May 28, 2019·5 cites

The Impact of GPU DVFS on the Energy and Performance of Deep Learning: an Empirical Study

Zhenheng Tang, Yuxin Wang, Qiang Wang, Xiaowen Chu

PDF

Open Access

TL;DR

This empirical study investigates how GPU Dynamic Voltage and Frequency Scaling (DVFS) affects energy consumption and performance in deep learning, revealing significant energy savings across various GPU architectures and DNN configurations.

Contribution

The paper provides a comprehensive empirical analysis of GPU DVFS impact on deep learning, highlighting optimal frequency settings for energy efficiency during training and inference.

Findings

01

Optimal core frequency reduces energy consumption by up to 23.1%.

02

Energy savings during inference range from 19.6% to 26.4%.

03

GPU DVFS can significantly enhance energy efficiency in deep learning workflows.

Abstract

Over the past years, great progress has been made in improving the computing power of general-purpose graphics processing units (GPGPUs), which facilitates the prosperity of deep neural networks (DNNs) in multiple fields like computer vision and natural language processing. A typical DNN training process repeatedly updates tens of millions of parameters, which not only requires huge computing resources but also consumes significant energy. In order to train DNNs in a more energy-efficient way, we empirically investigate the impact of GPU Dynamic Voltage and Frequency Scaling (DVFS) on the energy consumption and performance of deep learning. Our experiments cover a wide range of GPU architectures, DVFS settings, and DNN configurations. We observe that, compared to the default core frequency settings of three tested GPUs, the optimal core frequency can help conserve 8.7% $\sim$ 23.1% energy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Memory and Neural Computing · IoT and Edge/Fog Computing