Differentially Private Multivariate Statistics with an Application to Contingency Table Analysis
Minwoo Kim, Jonghyeok Lee, Seung Woo Kwak, Sungkyu Jung

TL;DR
This paper introduces improved Gaussian and Laplace mechanisms for differentially private multivariate statistics, especially contingency tables, demonstrating higher utility and power in hypothesis testing compared to existing methods.
Contribution
It develops rank-deficient James-Stein Gaussian mechanisms and minimal Laplace perturbation strategies under Gaussian differential privacy, enhancing utility in private multivariate data analysis.
Findings
Gaussian mechanisms outperform Laplace mechanisms at higher privacy levels.
Proposed mechanisms achieve higher statistical utility and test power.
Optimal Laplace calibration depends on more than global sensitivity.
Abstract
Differential privacy (DP) has become a rigorous central concept for privacy protection in the past decade. We use Gaussian differential privacy (GDP) in gauging the level of privacy protection for releasing statistical summaries from data. The GDP is a natural and easy-to-interpret differential privacy criterion based on the statistical hypothesis testing framework. The Gaussian mechanism is a natural and fundamental mechanism that can be used to perturb multivariate statistics to satisfy a -GDP criterion, where stands for the level of privacy protection. Requiring a certain level of differential privacy inevitably leads to a loss of statistical utility. We improve ordinary Gaussian mechanisms by developing rank-deficient James-Stein Gaussian mechanisms for releasing private multivariate statistics, and show that the proposed mechanisms have higher statistical utilities.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Random Matrices and Applications · Statistical Methods and Bayesian Inference
