Impact of ML Optimization Tactics on Greener Pre-Trained ML Models

Alexandra Gonz\'alez \'Alvarez; Joel Casta\~no; Xavier Franch,; Silverio Mart\'inez-Fern\'andez

arXiv:2409.12878·cs.LG·September 20, 2024

Impact of ML Optimization Tactics on Greener Pre-Trained ML Models

Alexandra Gonz\'alez \'Alvarez, Joel Casta\~no, Xavier Franch,, Silverio Mart\'inez-Fern\'andez

PDF

Open Access

TL;DR

This paper evaluates how different ML optimization techniques affect the energy efficiency and economic costs of pre-trained image classification models, highlighting effective methods for greener AI deployment.

Contribution

It provides a comprehensive analysis of PyTorch optimization tactics on pre-trained models, offering practical guidelines for reducing energy consumption and costs.

Findings

01

Dynamic quantization reduces inference time and energy significantly.

02

torch.compile balances accuracy with energy efficiency.

03

Global pruning increases costs due to longer optimization times.

Abstract

Background: Given the fast-paced nature of today's technology, which has surpassed human performance in tasks like image classification, visual reasoning, and English understanding, assessing the impact of Machine Learning (ML) on energy consumption is crucial. Traditionally, ML projects have prioritized accuracy over energy, creating a gap in energy consumption during model inference. Aims: This study aims to (i) analyze image classification datasets and pre-trained models, (ii) improve inference efficiency by comparing optimized and non-optimized models, and (iii) assess the economic impact of the optimizations. Method: We conduct a controlled experiment to evaluate the impact of various PyTorch optimization techniques (dynamic quantization, torch.compile, local pruning, and global pruning) to 42 Hugging Face models for image classification. The metrics examined include GPU…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification

MethodsPruning · ALIGN