Differentially Private Prototypes for Imbalanced Transfer Learning
Dariush Wahdany, Matthew Jagielski, Adam Dziedzic, Franziska Boenisch

TL;DR
This paper introduces Differentially Private Prototype Learning (DPPL), a novel method for private transfer learning that uses public pre-trained encoders to create private, high-utility class prototypes with strong privacy guarantees, especially effective in imbalanced data scenarios.
Contribution
DPPL offers a new paradigm for private transfer learning by generating DP prototypes from limited private data without iterative noise addition, improving privacy-utility trade-offs in imbalanced and low-data regimes.
Findings
DPPL achieves high utility predictions with strong privacy guarantees.
DPPL outperforms traditional DP-SGD approaches in imbalanced data settings.
Experimental results validate DPPL's effectiveness across multiple datasets and encoders.
Abstract
Machine learning (ML) models have been shown to leak private information from their training datasets. Differential Privacy (DP), typically implemented through the differential private stochastic gradient descent algorithm (DP-SGD), has become the standard solution to bound leakage from the models. Despite recent improvements, DP-SGD-based approaches for private learning still usually struggle in the high privacy ( and low data regimes, and when the private training datasets are imbalanced. To overcome these limitations, we propose Differentially Private Prototype Learning (DPPL) as a new paradigm for private transfer learning. DPPL leverages publicly pre-trained encoders to extract features from private data and generates DP prototypes that represent each private class in the embedding space and can be publicly released for inference. Since our DP prototypes can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
