Systematic Weight Evaluation for Pruning Large Language Models:   Enhancing Performance and Sustainability

Ashhadul Islam; Samir Brahim Belhaouari; Amine Bermak

arXiv:2502.17071·cs.CL·February 25, 2025

Systematic Weight Evaluation for Pruning Large Language Models: Enhancing Performance and Sustainability

Ashhadul Islam, Samir Brahim Belhaouari, Amine Bermak

PDF

Open Access

TL;DR

This paper introduces a systematic weight evaluation method for pruning large language models, aiming to improve efficiency and sustainability without sacrificing model performance.

Contribution

It proposes a novel approach that monitors weight importance over training to optimize pruning, balancing model size reduction with performance preservation.

Findings

01

Moderate pruning improves efficiency and reduces loss.

02

Excessive pruning significantly degrades model performance.

03

Monitoring weight evolution enables sustainable model development.

Abstract

The exponential growth of large language models (LLMs) like ChatGPT has revolutionized artificial intelligence, offering unprecedented capabilities in natural language processing. However, the extensive computational resources required for training these models have significant environmental implications, including high carbon emissions, energy consumption, and water usage. This research presents a novel approach to LLM pruning, focusing on the systematic evaluation of individual weight importance throughout the training process. By monitoring parameter evolution over time, we propose a method that effectively reduces model size without compromising performance. Extensive experiments with both a scaled-down LLM and a large multimodal model reveal that moderate pruning enhances efficiency and reduces loss, while excessive pruning drastically deteriorates model performance. These findings…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques

MethodsPruning