Weight Pruning Amplifies Bias: A Multi-Method Study of Compressed LLMs for Edge AI

Plawan Kumar Rath; Rahul Maliakkal

arXiv:2605.08137·cs.LG·May 12, 2026

Weight Pruning Amplifies Bias: A Multi-Method Study of Compressed LLMs for Edge AI

Plawan Kumar Rath, Rahul Maliakkal

PDF

36 Models

TL;DR

This study reveals that weight pruning in large language models can significantly amplify biases, despite maintaining language perplexity, raising concerns for deploying fair AI on resource-limited edge devices.

Contribution

It provides a comprehensive empirical analysis of how different pruning methods affect bias and language capabilities in instruction-tuned LLMs, highlighting risks for edge AI deployment.

Findings

01

Activation-aware pruning preserves perplexity but increases bias amplification.

02

Random pruning destroys language capability but results in random bias.

03

Pruning transition rates to biased states are higher than quantization.

Abstract

Weight pruning is widely advocated for deploying Large Language Models on resource-constrained IoT and edge devices, yet its impact on model fairness remains poorly understood. We conduct a controlled empirical study of three instruction-tuned models (Gemma-2-9b-it, Mistral-7B-Instruct-v0.3, Phi-3.5-mini-instruct) across three pruning methods (Random, Magnitude, Wanda) at four sparsity levels (10-70%) on 12,148 BBQ bias benchmark items with 5 random seeds, totaling 2,368,860 inference records. Our results reveal a Smart Pruning Paradox: activation-aware pruning (Wanda) preserves perplexity nearly perfectly (just 3.5% increase at 50% sparsity for Mistral-7B), yet produces the highest bias amplification, with Stereotype Reliance Score increasing 83.7% and 47-59% of previously unbiased items developing new stereotypical behaviors at 70% sparsity. Random pruning destroys language capability…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.