Green AI: Exploring Carbon Footprints, Mitigation Strategies, and Trade   Offs in Large Language Model Training

Vivian Liu; Yiqiao Yin

arXiv:2404.01157·cs.CL·April 2, 2024·3 cites

Green AI: Exploring Carbon Footprints, Mitigation Strategies, and Trade Offs in Large Language Model Training

Vivian Liu, Yiqiao Yin

PDF

Open Access

TL;DR

This paper evaluates the carbon footprints of large language models, compares hardware impacts, and proposes strategies for environmentally sustainable AI training without compromising model performance.

Contribution

It provides a comprehensive analysis of CO2 emissions in LLM training and suggests mitigation strategies, highlighting hardware choices and responsible training practices.

Findings

01

Large models have high carbon footprints due to their size.

02

Hardware choice significantly affects CO2 emissions during training.

03

Proposed mitigation strategies can reduce emissions without losing model robustness.

Abstract

Prominent works in the field of Natural Language Processing have long attempted to create new innovative models by improving upon previous model training approaches, altering model architecture, and developing more in-depth datasets to better their performance. However, with the quickly advancing field of NLP comes increased greenhouse gas emissions, posing concerns over the environmental damage caused by training LLMs. Gaining a comprehensive understanding of the various costs, particularly those pertaining to environmental aspects, that are associated with artificial intelligence serves as the foundational basis for ensuring safe AI models. Currently, investigations into the CO2 emissions of AI models remain an emerging area of research, and as such, in this paper, we evaluate the CO2 emissions of well-known large language models, which have an especially high carbon footprint due to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling