# Generating Poisson-Distributed Differentially Private Synthetic Data

**Authors:** Harrison Quick

arXiv: 1906.00455 · 2021-09-23

## TL;DR

This paper introduces a method for generating differentially private synthetic count data with Poisson distribution, improving utility especially for uneven subgroups, by bridging disease mapping techniques with formal privacy guarantees.

## Contribution

It extends existing privacy methods to Poisson-distributed data, incorporating prior information, and demonstrates improved utility over traditional techniques in simulated health data scenarios.

## Key findings

- Outperforms popular techniques in utility for uneven subgroups
- Effective for Poisson-distributed count data with prior information
- Demonstrated utility in simulated health data

## Abstract

The dissemination of synthetic data can be an effective means of making information from sensitive data publicly available while reducing the risk of disclosure associated with releasing the sensitive data directly. While mechanisms exist for synthesizing data that satisfy formal privacy guarantees, the utility of the synthetic data is often an afterthought. More recently, the use of methods from the disease mapping literature has been proposed to generate spatially-referenced synthetic data with high utility, albeit without formal privacy guarantees. The objective for this paper is to help bridge the gap between the disease mapping and the formal privacy literatures. In particular, we extend an existing approach for generating formally private synthetic data to the case of Poisson-distributed count data in a way that allows for the infusion of prior information. To evaluate the utility of the synthetic data, we conducted a simulation study inspired by publicly available, county-level heart disease-related death counts. The results of this study demonstrate that the proposed approach for generating differentially private synthetic data outperforms a popular technique when the counts correspond to events arising from subgroups with unequal population sizes or unequal event rates.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.00455/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/1906.00455/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/1906.00455/full.md

---
Source: https://tomesphere.com/paper/1906.00455