Revisiting Differentially Private Hypothesis Tests for Categorical Data

Yue Wang; Jaewoo Lee; Daniel Kifer

arXiv:1511.03376·cs.CR·March 21, 2017·44 cites

Revisiting Differentially Private Hypothesis Tests for Categorical Data

Yue Wang, Jaewoo Lee, Daniel Kifer

PDF

Open Access

TL;DR

This paper develops new differentially private hypothesis tests for categorical data that improve power and reliability by adjusting for privacy-induced noise, applicable to various statistical tests.

Contribution

It introduces practical, bias-corrected differentially private tests for categorical data, using a new asymptotic regime and modified test equivalences.

Findings

01

Enhanced test power under differential privacy

02

Reliable p-values with bias correction

03

Effective on diverse datasets and privacy levels

Abstract

In this paper, we consider methods for performing hypothesis tests on data protected by a statistical disclosure control technology known as differential privacy. Previous approaches to differentially private hypothesis testing either perturbed the test statistic with random noise having large variance (and resulted in a significant loss of power) or added smaller amounts of noise directly to the data but failed to adjust the test in response to the added noise (resulting in biased, unreliable $p$ -values). In this paper, we develop a variety of practical hypothesis tests that address these problems. Using a different asymptotic regime that is more suited to hypothesis testing with privacy, we show a modified equivalence between chi-squared tests and likelihood ratio tests. We then develop differentially private likelihood ratio and chi-squared tests for a variety of applications on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Mobile Crowdsensing and Crowdsourcing