
TL;DR
This paper introduces typical stability, a new notion of algorithmic stability that ensures generalization in adaptive data analysis without requiring bounded sensitivity or independent samples, applicable to a wide range of queries.
Contribution
The paper defines typical stability, analyzes its implications for generalization error, and provides composition theorems and noise-addition algorithms tailored to this new stability concept.
Findings
Typical stability controls generalization error in dependent, non-bounded sensitivity queries.
It applies to subgaussian and subexponential queries with light-tailed distributions.
The paper establishes composition guarantees for multiple adaptive queries.
Abstract
In this paper, we introduce a notion of algorithmic stability called typical stability. When our goal is to release real-valued queries (statistics) computed over a dataset, this notion does not require the queries to be of bounded sensitivity -- a condition that is generally assumed under differential privacy [DMNS06, Dwork06] when used as a notion of algorithmic stability [DFHPRR15a, DFHPRR15b, BNSSSU16] -- nor does it require the samples in the dataset to be independent -- a condition that is usually assumed when generalization-error guarantees are sought. Instead, typical stability requires the output of the query, when computed on a dataset drawn from the underlying distribution, to be concentrated around its expected value with respect to that distribution. We discuss the implications of typical stability on the generalization error (i.e., the difference between the value of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Machine Learning and Algorithms
