Pruning as a Defense: Reducing Memorization in Large Language Models

Mansi Gupta; Nikhar Waghela; Sarthak Gupta; Shourya Goel; Sanjif; Shanmugavelu

arXiv:2502.15796·cs.LG·February 25, 2025

Pruning as a Defense: Reducing Memorization in Large Language Models

Mansi Gupta, Nikhar Waghela, Sarthak Gupta, Shourya Goel, Sanjif, Shanmugavelu

PDF

Open Access

TL;DR

This paper explores how simple pruning techniques can significantly reduce memorization in large language models, thereby enhancing privacy and security by mitigating membership inference risks.

Contribution

It introduces pruning as an effective method to decrease memorization in LLMs, a novel approach for privacy preservation in large-scale models.

Findings

01

Pruning reduces memorization in large language models.

02

Pruning diminishes susceptibility to membership inference attacks.

03

Pruning maintains model performance while improving privacy.

Abstract

Large language models have been shown to memorize significant portions of their training data, which they can reproduce when appropriately prompted. This work investigates the impact of simple pruning techniques on this behavior. Our findings reveal that pruning effectively reduces the extent of memorization in LLMs, demonstrating its potential as a foundational approach for mitigating membership inference attacks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsPruning