Reporting and Analysing the Environmental Impact of Language Models on the Example of Commonsense Question Answering with External Knowledge
Aida Usmanova, Junbo Huang, Debayan Banerjee, Ricardo Usbeck

TL;DR
This paper assesses the environmental impact of training and fine-tuning large language models, specifically T5, for question-answering tasks, highlighting the importance of balancing performance with sustainability.
Contribution
It introduces a method to quantify and report the environmental impact of LLM training and fine-tuning, emphasizing sustainability considerations in model development.
Findings
Smaller models are not always more sustainable than larger ones.
Increased training does not necessarily lead to better performance.
Optimal results are achieved by balancing efficiency and accuracy.
Abstract
Human-produced emissions are growing at an alarming rate, causing already observable changes in the climate and environment in general. Each year global carbon dioxide emissions hit a new record, and it is reported that 0.5% of total US greenhouse gas emissions are attributed to data centres as of 2021. The release of ChatGPT in late 2022 sparked social interest in Large Language Models (LLMs), the new generation of Language Models with a large number of parameters and trained on massive amounts of data. Currently, numerous companies are releasing products featuring various LLMs, with many more models in development and awaiting release. Deep Learning research is a competitive field, with only models that reach top performance attracting attention and being utilized. Hence, achieving better accuracy and results is often the first priority, while the model's efficiency and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsGated Linear Unit · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · Softmax · Dense Connections · Inverse Square Root Schedule · Dropout · Linear Layer · Attention Dropout
