When Better Features Mean Greater Risks: The Performance-Privacy Trade-Off in Contrastive Learning

Ruining Sun; Hongsheng Hu; Wei Luo; Zhaoxi Zhang; Yanjun Zhang; Haizhuan Yuan; Leo Yu Zhang

arXiv:2506.05743·cs.CR·June 9, 2025

When Better Features Mean Greater Risks: The Performance-Privacy Trade-Off in Contrastive Learning

Ruining Sun, Hongsheng Hu, Wei Luo, Zhaoxi Zhang, Yanjun Zhang, Haizhuan Yuan, Leo Yu Zhang

PDF

TL;DR

This paper investigates privacy risks in contrastive learning encoder models, revealing that improved feature extraction capabilities increase privacy leakage, and introduces a novel attack method based on feature vector p-norms that outperforms existing techniques.

Contribution

It systematically analyzes privacy leakage in contrastive learning, and proposes the Embedding Lp-Norm Likelihood Attack (LpLA), a new method leveraging feature vector p-norms for membership inference.

Findings

01

Advanced encoder architectures increase privacy leakage.

02

LpLA outperforms existing attacks in effectiveness and robustness.

03

Privacy risks are significant in self-supervised contrastive learning models.

Abstract

With the rapid advancement of deep learning technology, pre-trained encoder models have demonstrated exceptional feature extraction capabilities, playing a pivotal role in the research and application of deep learning. However, their widespread use has raised significant concerns about the risk of training data privacy leakage. This paper systematically investigates the privacy threats posed by membership inference attacks (MIAs) targeting encoder models, focusing on contrastive learning frameworks. Through experimental analysis, we reveal the significant impact of model architecture complexity on membership privacy leakage: As more advanced encoder frameworks improve feature-extraction performance, they simultaneously exacerbate privacy-leakage risks. Furthermore, this paper proposes a novel membership inference attack method based on the p-norm of feature vectors, termed the Embedding…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSoftmax · Attention Is All You Need · Contrastive Learning