SoK: Reducing the Vulnerability of Fine-tuned Language Models to   Membership Inference Attacks

Guy Amit; Abigail Goldsteen; Ariel Farkash

arXiv:2403.08481·cs.LG·March 14, 2024·1 cites

SoK: Reducing the Vulnerability of Fine-tuned Language Models to Membership Inference Attacks

Guy Amit, Abigail Goldsteen, Ariel Farkash

PDF

Open Access

TL;DR

This paper systematically reviews how fine-tuned language models are vulnerable to membership inference attacks and evaluates defense strategies, highlighting that combining differential privacy with low-rank adaptors offers strong privacy protection.

Contribution

It provides the first comprehensive analysis of factors affecting privacy risks and defense effectiveness for fine-tuned language models against membership inference attacks.

Findings

01

Certain training methods reduce privacy risk

02

Differential privacy combined with low-rank adaptors offers best protection

03

Limited research exists on defense strategies in language models

Abstract

Natural language processing models have experienced a significant upsurge in recent years, with numerous applications being built upon them. Many of these applications require fine-tuning generic base models on customized, proprietary datasets. This fine-tuning data is especially likely to contain personal or sensitive information about individuals, resulting in increased privacy risk. Membership inference attacks are the most commonly employed attack to assess the privacy leakage of a machine learning model. However, limited research is available on the factors that affect the vulnerability of language models to this kind of attack, or on the applicability of different defense strategies in the language domain. We provide the first systematic review of the vulnerability of fine-tuned large language models to membership inference attacks, the various factors that come into play, and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Access Control and Trust

MethodsBalanced Selection