Characterizing the Sample Complexity of Private Learners
Amos Beimel, Kobbi Nissim, Uri Stemmer

TL;DR
This paper provides a combinatorial characterization of the sample complexity required for private learning, introducing the notion of probabilistic representation and the RepDim measure, analogous to VC dimension in non-private learning.
Contribution
It introduces the concept of probabilistic representation and RepDim, establishing a tight characterization of the sample complexity for private learning.
Findings
RepDim is both necessary and sufficient for private learning sample complexity
The characterization parallels VC dimension for non-private learning
Applicable to private data release and optimization problems
Abstract
In 2008, Kasiviswanathan et al. defined private learning as a combination of PAC learning and differential privacy. Informally, a private learner is applied to a collection of labeled individual information and outputs a hypothesis while preserving the privacy of each individual. Kasiviswanathan et al. gave a generic construction of private learners for (finite) concept classes, with sample complexity logarithmic in the size of the concept class. This sample complexity is higher than what is needed for non-private learners, hence leaving open the possibility that the sample complexity of private learning may be sometimes significantly higher than that of non-private learning. We give a combinatorial characterization of the sample size sufficient and necessary to privately learn a class of concepts. This characterization is analogous to the well known characterization of the sample…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Machine Learning and Algorithms
