Average-Case Averages: Private Algorithms for Smooth Sensitivity and Mean Estimation
Mark Bun, Thomas Steinke

TL;DR
This paper introduces instance-dependent noise scaling methods for differential privacy, improving accuracy in mean estimation by leveraging smooth sensitivity and proposing new noise distributions, especially for the trimmed mean estimator.
Contribution
It develops new methods for private mean estimation using smooth sensitivity and introduces three novel noise distributions for better accuracy.
Findings
Trimmed mean reduces sensitivity and improves privacy-accuracy trade-off.
New noise distributions outperform existing methods in experiments.
Methods are effective under average-case distributional assumptions.
Abstract
The simplest and most widely applied method for guaranteeing differential privacy is to add instance-independent noise to a statistic of interest that is scaled to its global sensitivity. However, global sensitivity is a worst-case notion that is often too conservative for realized dataset instances. We provide methods for scaling noise in an instance-dependent way and demonstrate that they provide greater accuracy under average-case distributional assumptions. Specifically, we consider the basic problem of privately estimating the mean of a real distribution from i.i.d.~samples. The standard empirical mean estimator can have arbitrarily-high global sensitivity. We propose the trimmed mean estimator, which interpolates between the mean and the median, as a way of attaining much lower sensitivity on average while losing very little in terms of statistical accuracy. To privately…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Adversarial Robustness in Machine Learning
