Differentially private ratio statistics
Tomer Shoham, Katrina Ligettt

TL;DR
This paper develops methods for differentially private ratio statistics, such as relative risk and odds ratios, ensuring privacy while maintaining accuracy and providing confidence intervals, addressing a significant gap in privacy-preserving statistical analysis.
Contribution
It introduces new differentially private estimators for ratio statistics, analyzes their properties, and offers practical tools for private hypothesis testing and decision-making.
Findings
Simple algorithms achieve good privacy, accuracy, and bias at small sample sizes.
The private estimator for relative risk is consistent.
Method for constructing valid confidence intervals under differential privacy.
Abstract
Ratio statistics--such as relative risk and odds ratios--play a central role in hypothesis testing, model evaluation, and decision-making across many areas of machine learning, including causal inference and fairness analysis. However, despite privacy concerns surrounding many datasets and despite increasing adoption of differential privacy, differentially private ratio statistics have largely been neglected by the literature and have only recently received an initial treatment by Lin et al. [1]. This paper attempts to fill this lacuna, giving results that can guide practice in evaluating ratios when the results must be protected by differential privacy. In particular, we show that even a simple algorithm can provide excellent properties concerning privacy, sample accuracy, and bias, not just asymptotically but also at quite small sample sizes. Additionally, we analyze a differentially…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Voting Systems · Privacy-Preserving Technologies in Data · Healthcare Policy and Management
