Optimising for Energy Efficiency and Performance in Machine Learning
Emile Dos Santos Ferreira, Andrei Paleyes, Neil D. Lawrence

TL;DR
This paper introduces ECOpt, a hyperparameter tuning tool that balances energy efficiency and performance in machine learning, providing insights into energy scaling laws and practical benefits for sustainable AI development.
Contribution
The paper presents ECOpt, a novel hyperparameter tuner that optimizes for energy efficiency alongside model performance, addressing measurement gaps and environmental concerns.
Findings
Parameter counts are unreliable proxies for energy consumption.
Energy efficiency of Transformer models is consistent across hardware.
ECOpt can improve CIFAR-10 models considering both accuracy and energy use.
Abstract
The ubiquity of machine learning (ML) and the demand for ever-larger models bring an increase in energy consumption and environmental impact. However, little is known about the energy scaling laws in ML, and existing research focuses on training cost -- ignoring the larger cost of inference. Furthermore, tools for measuring the energy consumption of ML do not provide actionable feedback. To address these gaps, we developed Energy Consumption Optimiser (ECOpt): a hyperparameter tuner that optimises for energy efficiency and model performance. ECOpt quantifies the trade-off between these metrics as an interpretable Pareto frontier. This enables ML practitioners to make informed decisions about energy cost and environmental impact, while maximising the benefit of their models and complying with new regulations. Using ECOpt, we show that parameter and floating-point operation counts can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGreen IT and Sustainability · Big Data and Digital Economy · Machine Learning and Data Classification
