Hugging Carbon: Quantifying the Training Carbon Emissions of AI Models at Scale
Xinlei Wang, Ruibo Ming, Jing Qiu, Junhua Zhao, Jinjin Gu

TL;DR
This paper introduces a scalable FLOPs-based framework for estimating the carbon emissions of large-scale AI model training, using Hugging Face models as a case study to assess environmental impact.
Contribution
It presents a novel tiered approach for carbon accounting of open-source models and introduces the AI training carbon intensity metric for sustainability assessment.
Findings
Training popular open-source models emitted approximately 58,000 metric tons of CO2.
The framework enables aggregate emission estimates without reproducing original models.
Empirical regressions support the statistical significance of the methodology.
Abstract
The scaling-law era has transformed artificial intelligence from research into a global industry, but its rapid growth raises concerns over energy usage, carbon emissions, and environmental sustainability. Unlike traditional sectors, the AI industry still lacks systematic carbon accounting methods that support large-scale estimates without reproducing the original model. This leaves open questions about how large the problem is today and how large it might be in the near future. Given that the Hugging Face (HF) platform well represents the broader open-source community, we treat it as a large-scale, publicly accessible, and audit-ready corpus for carbon accounting. We propose a FLOPs-based framework to estimate aggregate training emissions of HF open-source models. Considering their uneven disclosure quality, we introduce a tiered approach to handle incomplete metadata, supported by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
