NeuralPower: Predict and Deploy Energy-Efficient Convolutional Neural Networks
Ermao Cai, Da-Cheng Juan, Dimitrios Stamoulis, Diana Marculescu

TL;DR
NeuralPower is a predictive framework that accurately estimates the energy consumption, runtime, and power of CNNs on GPUs, aiding in designing energy-efficient models before training.
Contribution
It introduces NeuralPower, a layer-wise prediction model using sparse polynomial regression for energy estimation of CNNs on any GPU platform, and proposes the EPR metric for energy-efficient architecture selection.
Findings
Achieves up to 68.5% improvement in prediction accuracy over previous models.
Predicts network-level energy, runtime, and power with over 88% accuracy.
Validated across multiple GPU platforms and deep learning tools.
Abstract
"How much energy is consumed for an inference made by a convolutional neural network (CNN)?" With the increased popularity of CNNs deployed on the wide-spectrum of platforms (from mobile devices to workstations), the answer to this question has drawn significant attention. From lengthening battery life of mobile devices to reducing the energy bill of a datacenter, it is important to understand the energy efficiency of CNNs during serving for making an inference, before actually training the model. In this work, we propose NeuralPower: a layer-wise predictive framework based on sparse polynomial regression, for predicting the serving energy consumption of a CNN deployed on any GPU platform. Given the architecture of a CNN, NeuralPower provides an accurate prediction and breakdown for power and runtime across all layers in the whole network, helping machine learners quickly identify the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Neural Networks and Applications · Adversarial Robustness in Machine Learning
