LLMCarbon: Modeling the end-to-end Carbon Footprint of Large Language   Models

Ahmad Faiz; Sotaro Kaneda; Ruhan Wang; Rita Osi; Prateek Sharma; Fan; Chen; Lei Jiang

arXiv:2309.14393·cs.CL·January 22, 2024·33 cites

LLMCarbon: Modeling the end-to-end Carbon Footprint of Large Language Models

Ahmad Faiz, Sotaro Kaneda, Ruhan Wang, Rita Osi, Prateek Sharma, Fan, Chen, Lei Jiang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces extit{ extbf{ exttt{ extbackslash carb}}}, a comprehensive model for accurately estimating the carbon footprint of large language models, including dense and MoE architectures, before training begins.

Contribution

It presents extit{ extbf{ exttt{ extbackslash carb}}}, a novel end-to-end model that overcomes limitations of existing tools like mlco2, enabling precise carbon footprint predictions for various LLM architectures.

Findings

01

extit{ extbf{ exttt{ extbackslash carb}}} improves estimation accuracy over mlco2.

02

It models both dense and MoE LLMs.

03

The tool accounts for architectural parameters and embodied carbon.

Abstract

The carbon footprint associated with large language models (LLMs) is a significant concern, encompassing emissions from their training, inference, experimentation, and storage processes, including operational and embodied carbon emissions. An essential aspect is accurately estimating the carbon impact of emerging LLMs even before their training, which heavily relies on GPU usage. Existing studies have reported the carbon footprint of LLM training, but only one tool, mlco2, can predict the carbon footprint of new neural networks prior to physical training. However, mlco2 has several serious limitations. It cannot extend its estimation to dense or mixture-of-experts (MoE) LLMs, disregards critical architectural parameters, focuses solely on GPUs, and cannot model embodied carbon footprints. Addressing these gaps, we introduce \textit{\carb}, an end-to-end carbon footprint projection model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sotarokaneda/mlcarbon
noneOfficial

Videos

LLMCarbon: Modeling the End-to-End Carbon Footprint of Large Language Models· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques