PPC-GPT: Federated Task-Specific Compression of Large Language Models via Pruning and Chain-of-Thought Distillation
Tao Fan, Guoqiang Ma, Yuanfeng Song, Lixin Fan, Qiang Yang

TL;DR
PPC-GPT introduces a federated framework that combines privacy-preserving data perturbation, synthetic data generation, and task-specific model compression to efficiently create small, domain-specific language models without compromising privacy.
Contribution
It presents a novel unified approach integrating differential privacy, synthetic data, and pruning for federated LLM compression, addressing privacy and resource constraints simultaneously.
Findings
Achieves performance comparable to full LLMs on various tasks.
Ensures robust privacy protection through federated architecture.
Effectively compresses models while maintaining task-specific accuracy.
Abstract
Compressing Large Language Models (LLMs) into task-specific Small Language Models (SLMs) encounters two significant challenges: safeguarding domain-specific knowledge privacy and managing limited resources. To tackle these challenges, we propose PPC-GPT, a novel unified framework that systematically addresses both privacy preservation and model compression in federated settings. PPC-GPT works on a server-client federated architecture, where the client sends differentially private (DP) perturbed task-specific data to the server's LLM. The LLM then generates synthetic data along with their corresponding rationales. This synthetic data is subsequently used for both LLM pruning and retraining processes. Our framework's key innovation lies in its holistic integration of privacy-preserving mechanisms, synthetic data generation, and task-specific compression techniques, creating unique…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsScientific Computing and Data Management · Machine Learning in Healthcare
MethodsPruning
