Quantifying and Mitigating Privacy Risks of Contrastive Learning

Xinlei He; Yang Zhang

arXiv:2102.04140·cs.LG·September 22, 2021·6 cites

Quantifying and Mitigating Privacy Risks of Contrastive Learning

Xinlei He, Yang Zhang

PDF

Open Access 1 Repo

TL;DR

This paper analyzes privacy risks in contrastive learning, revealing its vulnerabilities to attribute inference and proposing Talos, an adversarial training method, to mitigate these risks while preserving utility.

Contribution

It is the first to analyze privacy risks of contrastive learning and introduces Talos, a novel adversarial training approach for privacy preservation.

Findings

01

Contrastive models are less vulnerable to membership inference.

02

Contrastive models are more vulnerable to attribute inference.

03

Talos effectively reduces attribute inference risks while maintaining utility.

Abstract

Data is the key factor to drive the development of machine learning (ML) during the past decade. However, high-quality data, in particular labeled data, is often hard and expensive to collect. To leverage large-scale unlabeled data, self-supervised learning, represented by contrastive learning, is introduced. The objective of contrastive learning is to map different views derived from a training sample (e.g., through data augmentation) closer in their representation space, while different views derived from different samples more distant. In this way, a contrastive model learns to generate informative representations for data samples, which are then used to perform downstream ML tasks. Recent research has shown that machine learning models are vulnerable to various privacy attacks. However, most of the current efforts concentrate on models trained with supervised learning. Meanwhile,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xinleihe/contrastiveleaks
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning

MethodsContrastive Learning