On the Limitations of Language Targeted Pruning: Investigating the Calibration Language Impact in Multilingual LLM Pruning

Simon Kurz; Jian-Jia Chen; Lucie Flek; Zhixue Zhao

arXiv:2408.14398·cs.CL·November 18, 2025

On the Limitations of Language Targeted Pruning: Investigating the Calibration Language Impact in Multilingual LLM Pruning

Simon Kurz, Jian-Jia Chen, Lucie Flek, Zhixue Zhao

PDF

Open Access

TL;DR

This paper investigates how language-specific calibration impacts the effectiveness of pruning multilingual large language models, revealing limitations in preserving nuanced, language-agnostic knowledge crucial for diverse tasks.

Contribution

It provides the first comprehensive empirical analysis of calibration language effects in multilingual LLM pruning, highlighting limitations in current approaches.

Findings

01

Calibration on target language improves perplexity but not downstream tasks.

02

Pruning preserves language-specific features but loses nuanced, language-agnostic knowledge.

03

Current pruning methods have broader limitations in retaining essential internal representations.

Abstract

Recent advances in large language model (LLM) pruning have shown state-of-the-art (SotA) compression results in post-training and retraining-free settings while maintaining high predictive performance. However, previous research mainly considered calibrating based on English text, despite the multilingual nature of modern LLMs and their frequent use in non-English languages. This analysis paper conducts an in-depth investigation of the performance and internal representation changes associated with pruning multilingual language models for monolingual applications. We present the first comprehensive empirical study, comparing different calibration languages for pruning multilingual models across diverse languages, tasks, models, and SotA pruning techniques. We further analyze the latent subspaces, pruning masks, and individual neurons within pruned models. Our results reveal that while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsSparse Evolutionary Training · Pruning