TL;DR
This study reveals that weight pruning in large language models can significantly amplify biases, despite maintaining language perplexity, raising concerns for deploying fair AI on resource-limited edge devices.
Contribution
It provides a comprehensive empirical analysis of how different pruning methods affect bias and language capabilities in instruction-tuned LLMs, highlighting risks for edge AI deployment.
Findings
Activation-aware pruning preserves perplexity but increases bias amplification.
Random pruning destroys language capability but results in random bias.
Pruning transition rates to biased states are higher than quantization.
Abstract
Weight pruning is widely advocated for deploying Large Language Models on resource-constrained IoT and edge devices, yet its impact on model fairness remains poorly understood. We conduct a controlled empirical study of three instruction-tuned models (Gemma-2-9b-it, Mistral-7B-Instruct-v0.3, Phi-3.5-mini-instruct) across three pruning methods (Random, Magnitude, Wanda) at four sparsity levels (10-70%) on 12,148 BBQ bias benchmark items with 5 random seeds, totaling 2,368,860 inference records. Our results reveal a Smart Pruning Paradox: activation-aware pruning (Wanda) preserves perplexity nearly perfectly (just 3.5% increase at 50% sparsity for Mistral-7B), yet produces the highest bias amplification, with Stereotype Reliance Score increasing 83.7% and 47-59% of previously unbiased items developing new stereotypical behaviors at 70% sparsity. Random pruning destroys language capability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗plawanrath/phi-3.5-mini-instruct-random-s10-piamodel· 28 dl28 dl
- 🤗plawanrath/phi-3.5-mini-instruct-magnitude-s10-piamodel· 24 dl24 dl
- 🤗plawanrath/phi-3.5-mini-instruct-wanda-s10-piamodel· 25 dl25 dl
- 🤗plawanrath/phi-3.5-mini-instruct-random-s30-piamodel· 28 dl28 dl
- 🤗plawanrath/phi-3.5-mini-instruct-magnitude-s30-piamodel· 29 dl· ♡ 129 dl♡ 1
- 🤗plawanrath/phi-3.5-mini-instruct-wanda-s30-piamodel· 25 dl25 dl
- 🤗plawanrath/phi-3.5-mini-instruct-random-s50-piamodel· 27 dl27 dl
- 🤗plawanrath/phi-3.5-mini-instruct-magnitude-s50-piamodel· 28 dl· ♡ 128 dl♡ 1
- 🤗plawanrath/phi-3.5-mini-instruct-wanda-s50-piamodel· 20 dl20 dl
- 🤗plawanrath/phi-3.5-mini-instruct-random-s70-piamodel· 27 dl27 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
