TL;DR
This paper introduces methods for constructing valid differentially private confidence intervals for the median, emphasizing direct estimation of interval bounds and addressing multiple sources of uncertainty, with extensive simulations and real data application.
Contribution
It proposes novel strategies for differentially private median confidence intervals, focusing on direct interval estimation and combined uncertainty management, improving statistical inference under privacy constraints.
Findings
Direct estimation of interval bounds outperforms traditional methods.
Simultaneous handling of sampling and privacy-induced uncertainty yields better intervals.
Algorithms perform well across various parameters in simulations and real census data.
Abstract
Differential privacy is a restriction on data processing algorithms that provides strong confidentiality guarantees for individual records in the data. However, research on proper statistical inference, that is, research on properly quantifying the uncertainty of the (noisy) sample estimate regarding the true value in the population, is currently still limited. This paper proposes and evaluates several strategies to compute valid differentially private confidence intervals for the median. Instead of computing a differentially private point estimate and deriving its uncertainty, we directly estimate the interval bounds and discuss why this approach is superior if ensuring privacy is important. We also illustrate that addressing both sources of uncertainty--the error from sampling and the error from protecting the output--simultaneously should be preferred over simpler approaches that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
