How Green are Neural Language Models? Analyzing Energy Consumption in   Text Summarization Fine-tuning

Tohida Rehman; Debarshi Kumar Sanyal; Samiran Chattopadhyay

arXiv:2501.15398·cs.CL·March 17, 2025

How Green are Neural Language Models? Analyzing Energy Consumption in Text Summarization Fine-tuning

Tohida Rehman, Debarshi Kumar Sanyal, Samiran Chattopadhyay

PDF

Open Access

TL;DR

This paper evaluates the environmental impact of fine-tuning neural language models for text summarization, highlighting the significant carbon footprint of larger models and advocating for energy-efficient AI practices.

Contribution

It provides a comparative analysis of energy consumption and performance trade-offs among three neural language models during fine-tuning for summarization tasks.

Findings

01

LLaMA-3-8B has the highest carbon footprint among the models.

02

Performance metrics vary across models, with larger models generally performing better.

03

Environmental impact should be considered in neural language model development.

Abstract

Artificial intelligence systems significantly impact the environment, particularly in natural language processing (NLP) tasks. These tasks often require extensive computational resources to train deep neural networks, including large-scale language models containing billions of parameters. This study analyzes the trade-offs between energy consumption and performance across three neural language models: two pre-trained models (T5-base and BART-base), and one large language model (LLaMA-3-8B). These models were fine-tuned for the text summarization task, focusing on generating research paper highlights that encapsulate the core themes of each paper. The carbon footprint associated with fine-tuning each model was measured, offering a comprehensive assessment of their environmental impact. It is observed that LLaMA-3-8B produces the largest carbon footprint among the three models. A wide…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques