Quantifying and Mitigating Privacy Risks of Contrastive Learning
Xinlei He, Yang Zhang

TL;DR
This paper analyzes privacy risks in contrastive learning, revealing its vulnerabilities to attribute inference and proposing Talos, an adversarial training method, to mitigate these risks while preserving utility.
Contribution
It is the first to analyze privacy risks of contrastive learning and introduces Talos, a novel adversarial training approach for privacy preservation.
Findings
Contrastive models are less vulnerable to membership inference.
Contrastive models are more vulnerable to attribute inference.
Talos effectively reduces attribute inference risks while maintaining utility.
Abstract
Data is the key factor to drive the development of machine learning (ML) during the past decade. However, high-quality data, in particular labeled data, is often hard and expensive to collect. To leverage large-scale unlabeled data, self-supervised learning, represented by contrastive learning, is introduced. The objective of contrastive learning is to map different views derived from a training sample (e.g., through data augmentation) closer in their representation space, while different views derived from different samples more distant. In this way, a contrastive model learns to generate informative representations for data samples, which are then used to perform downstream ML tasks. Recent research has shown that machine learning models are vulnerable to various privacy attacks. However, most of the current efforts concentrate on models trained with supervised learning. Meanwhile,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
MethodsContrastive Learning
