Energy Considerations for Large Pretrained Neural Networks

Leo Mei; Mark Stamp

arXiv:2506.01311·cs.LG·June 3, 2025

Energy Considerations for Large Pretrained Neural Networks

Leo Mei, Mark Stamp

PDF

Open Access

TL;DR

This paper evaluates how different compression techniques for large neural networks impact their electricity consumption, revealing that steganographic capacity reduction significantly reduces energy use compared to pruning and low-rank factorization.

Contribution

It provides a comparative analysis of energy consumption for various compression methods on large pre-trained models, highlighting the effectiveness of steganographic capacity reduction.

Findings

01

Steganographic capacity reduction greatly decreases energy consumption.

02

Pruning and low-rank factorization show minimal energy savings.

03

Compression techniques vary significantly in their impact on electricity use.

Abstract

Increasingly complex neural network architectures have achieved phenomenal performance. However, these complex models require massive computational resources that consume substantial amounts of electricity, which highlights the potential environmental impact of such models. Previous studies have demonstrated that substantial redundancies exist in large pre-trained models. However, previous work has primarily focused on compressing models while retaining comparable model performance, and the direct impact on electricity consumption appears to have received relatively little attention. By quantifying the energy usage associated with both uncompressed and compressed models, we investigate compression as a means of reducing electricity consumption. We consider nine different pre-trained models, ranging in size from 8M parameters to 138M parameters. To establish a baseline, we first train…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications