Energy Scaling Laws for Diffusion Models: Quantifying Compute in Image Generation
Aniketh Iyengar, Jiaqi Han, Boris Ruf, Vincent Grari, Marcin Detyniecki, Stefano Ermon

TL;DR
This paper introduces an energy scaling law based on computational complexity to predict GPU energy consumption of diffusion models, aiding sustainable AI deployment.
Contribution
It adapts Kaplan scaling laws to diffusion models, providing a principled, accurate method to estimate energy use across various configurations and hardware.
Findings
High predictive accuracy within architectures ($R^2 > 0.9$).
Strong cross-architecture generalization of energy estimates.
Validation across multiple models, hardware, and inference settings.
Abstract
The rapidly growing computational demands of diffusion models for image generation have raised significant concerns about energy consumption and environmental impact. While existing approaches to energy optimization focus on architectural improvements or hardware acceleration, there is a lack of principled methods to predict energy consumption across different model configurations and hardware setups. We propose an adaptation of Kaplan scaling laws to predict GPU energy consumption for diffusion models based on computational complexity (FLOPs). Our approach decomposes diffusion model inference into text encoding, iterative denoising, and decoding components, with the hypothesis that denoising operations dominate energy consumption due to their repeated execution across multiple inference steps. We conduct comprehensive experiments across four state-of-the-art diffusion models (Stable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
