Private Distribution Learning with Public Data: The View from Sample Compression
Shai Ben-David, Alex Bie, Cl\'ement L. Canonne, Gautam Kamath, Vikrant, Singhal

TL;DR
This paper explores private distribution learning using both public and private data, establishing a connection with sample compression schemes and providing new bounds and results for Gaussian mixtures and other distribution classes.
Contribution
It introduces a novel link between public-private learnability and sample compression, leading to new bounds and results for Gaussian mixtures and distribution classes.
Findings
Public-private learnability is connected to sample compression schemes.
New sample complexity bounds for Gaussian mixtures.
At least d public samples are necessary for Gaussian private learning.
Abstract
We study the problem of private distribution learning with access to public data. In this setup, which we refer to as public-private learning, the learner is given public and private samples drawn from an unknown distribution belonging to a class , with the goal of outputting an estimate of while adhering to privacy constraints (here, pure differential privacy) only with respect to the private samples. We show that the public-private learnability of a class is connected to the existence of a sample compression scheme for , as well as to an intermediate notion we refer to as list learning. Leveraging this connection: (1) approximately recovers previous results on Gaussians over ; and (2) leads to new ones, including sample complexity upper bounds for arbitrary -mixtures of Gaussians over , results for agnostic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Machine Learning and Algorithms
