Quantifying the Energy Consumption and Carbon Emissions of LLM Inference via Simulations
Miray \"Ozcan, Philipp Wiesner, Philipp Wei{\ss}, Odej Kao

TL;DR
This paper introduces a simulation framework that accurately estimates the energy consumption and carbon emissions of LLM inference by incorporating power models and grid conditions, enabling better deployment decisions.
Contribution
We extend existing inference simulators with GPU power modeling and integrate them into an energy system co-simulation for comprehensive environmental impact analysis.
Findings
Inference parameters significantly influence energy demand and emissions.
Potential for up to 69.2% renewable offset in specific deployment scenarios.
Framework supports development of carbon-aware LLM inference strategies.
Abstract
The environmental impact of Large Language Models (LLMs) is rising significantly, with inference now accounting for more than half of their total lifecycle carbon emissions. However, existing simulation frameworks, which are increasingly used to determine efficient LLM deployments, lack any concept of power and, therefore, cannot accurately estimate inference-related emissions. We present a simulation framework to assess the energy and carbon implications of LLM inference under varying deployment setups. First, we extend a high-fidelity LLM inference simulator with a GPU power model that estimates power consumption based on utilization metrics, enabling analysis across configurations like batch size, sequence length, and model parallelism. Second, we integrate simulation outputs into an energy system co-simulation environment to quantify carbon emissions under specific grid conditions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNuclear reactor physics and engineering
