Membership and Memorization in LLM Knowledge Distillation
Ziqi Zhang, Ali Shahin Shamsabadi, Hanxiao Lu, Yifeng Cai, Hamed Haddadi

TL;DR
This paper investigates privacy risks, specifically membership and memorization, in knowledge distillation of large language models, revealing that all existing methods pose privacy threats with varying degrees and characteristics.
Contribution
It systematically characterizes privacy risks in LLM knowledge distillation, analyzing how different techniques and components influence these risks and highlighting variability across models and data blocks.
Findings
All KD techniques pose privacy risks from teacher to student.
Privacy risks vary significantly across different KD methods and components.
Memorization and membership privacy risks can differ greatly across model blocks.
Abstract
Recent advances in Knowledge Distillation (KD) aim to mitigate the high computational demands of Large Language Models (LLMs) by transferring knowledge from a large ''teacher'' to a smaller ''student'' model. However, students may inherit the teacher's privacy when the teacher is trained on private data. In this work, we systematically characterize and investigate membership and memorization privacy risks inherent in six LLM KD techniques. Using instruction-tuning settings that span seven NLP tasks, together with three teacher model families (GPT-2, LLAMA-2, and OPT), and various size student models, we demonstrate that all existing LLM KD approaches carry membership and memorization privacy risks from the teacher to its students. However, the extent of privacy risks varies across different KD techniques. We systematically analyse how key LLM KD components (KD objective functions,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Explainable Artificial Intelligence (XAI) · Advanced Graph Neural Networks
