Differentially Private Bootstrap: New Privacy Analysis and Inference Strategies
Zhanyu Wang, Guang Cheng, Jordan Awan

TL;DR
This paper introduces a new privacy analysis for the DP bootstrap, providing methods for private statistical inference, including confidence intervals, with proven validity and optimal convergence, applicable to various regression tasks.
Contribution
It presents a novel privacy cost analysis for DP bootstrap estimates, a numerical method for exact privacy accounting, and new private inference techniques for multiple statistical tasks.
Findings
Private CIs achieve nominal coverage levels.
The Gaussian-DP framework effectively bounds privacy loss for multiple estimates.
The methods are validated on census data and outperform existing approaches.
Abstract
Differentially private (DP) mechanisms protect individual-level information by introducing randomness into the statistical analysis procedure. Despite the availability of numerous DP tools, there remains a lack of general techniques for conducting statistical inference under DP. We examine a DP bootstrap procedure that releases multiple private bootstrap estimates to infer the sampling distribution and construct confidence intervals (CIs). Our privacy analysis presents new results on the privacy cost of a single DP bootstrap estimate, applicable to any DP mechanism, and identifies some misapplications of the bootstrap in the existing literature. For the composition of the DP bootstrap, we present a numerical method to compute the exact privacy cost of releasing multiple DP bootstrap estimates, and using the Gaussian-DP (GDP) framework (Dong et al., 2022), we show that the release of …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Privacy-Preserving Technologies in Data · Advanced Causal Inference Techniques
