HyperPower: Power- and Memory-Constrained Hyper-Parameter Optimization   for Neural Networks

Dimitrios Stamoulis; Ermao Cai; Da-Cheng Juan; Diana Marculescu

arXiv:1712.02446·cs.LG·December 8, 2017

HyperPower: Power- and Memory-Constrained Hyper-Parameter Optimization for Neural Networks

Dimitrios Stamoulis, Ermao Cai, Da-Cheng Juan, Diana Marculescu

PDF

TL;DR

HyperPower introduces a novel framework for efficient hyper-parameter optimization of neural networks under power and memory constraints, utilizing predictive models and Bayesian methods to significantly accelerate the process and improve accuracy.

Contribution

It is the first to use power as a known constraint and develop predictive models for power and memory of neural networks on GPUs, enhancing optimization efficiency.

Findings

01

Achieves up to 112.99x faster hyper-parameter tuning

02

Reaches up to 30.12x better test error with constraints

03

Speeds up evaluations by up to 57.20x

Abstract

While selecting the hyper-parameters of Neural Networks (NNs) has been so far treated as an art, the emergence of more complex, deeper architectures poses increasingly more challenges to designers and Machine Learning (ML) practitioners, especially when power and memory constraints need to be considered. In this work, we propose HyperPower, a framework that enables efficient Bayesian optimization and random search in the context of power- and memory-constrained hyper-parameter optimization for NNs running on a given hardware platform. HyperPower is the first work (i) to show that power consumption can be used as a low-cost, a priori known constraint, and (ii) to propose predictive models for the power and memory of NNs executing on GPUs. Thanks to HyperPower, the number of function evaluations and the best test error achieved by a constraint-unaware method are reached up to 112.99x and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsRandom Search