Typical Stability

Raef Bassily; Yoav Freund

arXiv:1604.03336·cs.LG·September 20, 2016

Typical Stability

Raef Bassily, Yoav Freund

PDF

Open Access

TL;DR

This paper introduces typical stability, a new notion of algorithmic stability that ensures generalization in adaptive data analysis without requiring bounded sensitivity or independent samples, applicable to a wide range of queries.

Contribution

The paper defines typical stability, analyzes its implications for generalization error, and provides composition theorems and noise-addition algorithms tailored to this new stability concept.

Findings

01

Typical stability controls generalization error in dependent, non-bounded sensitivity queries.

02

It applies to subgaussian and subexponential queries with light-tailed distributions.

03

The paper establishes composition guarantees for multiple adaptive queries.

Abstract

In this paper, we introduce a notion of algorithmic stability called typical stability. When our goal is to release real-valued queries (statistics) computed over a dataset, this notion does not require the queries to be of bounded sensitivity -- a condition that is generally assumed under differential privacy [DMNS06, Dwork06] when used as a notion of algorithmic stability [DFHPRR15a, DFHPRR15b, BNSSSU16] -- nor does it require the samples in the dataset to be independent -- a condition that is usually assumed when generalization-error guarantees are sought. Instead, typical stability requires the output of the query, when computed on a dataset drawn from the underlying distribution, to be concentrated around its expected value with respect to that distribution. We discuss the implications of typical stability on the generalization error (i.e., the difference between the value of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Machine Learning and Algorithms