Preserving Privacy in Large Language Models: A Survey on Current Threats   and Solutions

Michele Miranda; Elena Sofia Ruzzetti; Andrea Santilli; Fabio Massimo; Zanzotto; S\'ebastien Brati\`eres; Emanuele Rodol\`a

arXiv:2408.05212·cs.CR·February 11, 2025·3 cites

Preserving Privacy in Large Language Models: A Survey on Current Threats and Solutions

Michele Miranda, Elena Sofia Ruzzetti, Andrea Santilli, Fabio Massimo, Zanzotto, S\'ebastien Brati\`eres, Emanuele Rodol\`a

PDF

Open Access 1 Repo

TL;DR

This survey reviews privacy threats in Large Language Models, analyzing attacks and solutions like differential privacy and unlearning to enhance security and trustworthiness in AI systems.

Contribution

It provides a comprehensive overview of privacy threats and evaluates current mitigation techniques for protecting sensitive information in LLMs.

Findings

01

Privacy attacks on LLMs are prevalent and varied.

02

Differential privacy and unlearning are promising solutions.

03

Challenges remain in balancing privacy and model utility.

Abstract

Large Language Models (LLMs) represent a significant advancement in artificial intelligence, finding applications across various domains. However, their reliance on massive internet-sourced datasets for training brings notable privacy issues, which are exacerbated in critical domains (e.g., healthcare). Moreover, certain application-specific scenarios may require fine-tuning these models on private data. This survey critically examines the privacy threats associated with LLMs, emphasizing the potential for these models to memorize and inadvertently reveal sensitive information. We explore current threats by reviewing privacy attacks on LLMs and propose comprehensive solutions for integrating privacy mechanisms throughout the entire learning pipeline. These solutions range from anonymizing training datasets to implementing differential privacy during training or inference and machine…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

michele17284/awesome-privacy-preserving-llms
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data