Differentially Private Representation Learning via Image Captioning

Tom Sander; Yaodong Yu; Maziar Sanjabi; Alain Durmus; Yi Ma; Kamalika; Chaudhuri; Chuan Guo

arXiv:2403.02506·cs.CV·October 31, 2024·3 cites

Differentially Private Representation Learning via Image Captioning

Tom Sander, Yaodong Yu, Maziar Sanjabi, Alain Durmus, Yi Ma, Kamalika, Chaudhuri, Chuan Guo

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel method for differentially private representation learning using image captioning, achieving high-quality image features on large-scale datasets and significantly improving privacy-accuracy trade-offs in vision tasks.

Contribution

The authors develop a scalable DP image captioning model trained on large multimodal datasets, enabling high-quality private image representations for downstream tasks.

Findings

01

DP-Cap achieves 65.8% accuracy on ImageNet-1K under privacy budget ε=8.

02

Significant improvement over previous state-of-the-art (56.5%).

03

Effective training on 233M images from LAION-2B dataset.

Abstract

Differentially private (DP) machine learning is considered the gold-standard solution for training a model from sensitive data while still preserving privacy. However, a major barrier to achieving this ideal is its sub-optimal privacy-accuracy trade-off, which is particularly visible in DP representation learning. Specifically, it has been shown that under modest privacy budgets, most models learn representations that are not significantly better than hand-crafted features. In this work, we show that effective DP representation learning can be done via image captioning and scaling up to internet-scale multimodal datasets. Through a series of engineering tricks, we successfully train a DP image captioner (DP-Cap) on a 233M subset of LAION-2B from scratch using a reasonable amount of computation, and obtaining unprecedented high-quality image features that can be used in a variety of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

facebookresearch/dpcap
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI