Impact of ML Optimization Tactics on Greener Pre-Trained ML Models
Alexandra Gonz\'alez \'Alvarez, Joel Casta\~no, Xavier Franch,, Silverio Mart\'inez-Fern\'andez

TL;DR
This paper evaluates how different ML optimization techniques affect the energy efficiency and economic costs of pre-trained image classification models, highlighting effective methods for greener AI deployment.
Contribution
It provides a comprehensive analysis of PyTorch optimization tactics on pre-trained models, offering practical guidelines for reducing energy consumption and costs.
Findings
Dynamic quantization reduces inference time and energy significantly.
torch.compile balances accuracy with energy efficiency.
Global pruning increases costs due to longer optimization times.
Abstract
Background: Given the fast-paced nature of today's technology, which has surpassed human performance in tasks like image classification, visual reasoning, and English understanding, assessing the impact of Machine Learning (ML) on energy consumption is crucial. Traditionally, ML projects have prioritized accuracy over energy, creating a gap in energy consumption during model inference. Aims: This study aims to (i) analyze image classification datasets and pre-trained models, (ii) improve inference efficiency by comparing optimized and non-optimized models, and (iii) assess the economic impact of the optimizations. Method: We conduct a controlled experiment to evaluate the impact of various PyTorch optimization techniques (dynamic quantization, torch.compile, local pruning, and global pruning) to 42 Hugging Face models for image classification. The metrics examined include GPU…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification
MethodsPruning · ALIGN
