Bayesian Estimation of Attribute Disclosure Risks in Synthetic Data with the $\texttt{AttributeRiskCalculation}$ R Package
Ryan Hornby, Jingchen Hu

TL;DR
This paper introduces a Bayesian approach and an R package for evaluating attribute disclosure risks in synthetic data, aiding privacy protection assessments before data release.
Contribution
It presents a detailed review of Bayesian methods for attribute disclosure risk estimation and provides the ttributeRiskCalculation R package for practical implementation.
Findings
The package effectively evaluates disclosure risks in synthetic datasets.
Bayesian methods offer a detailed assessment of attribute disclosure risks.
Application to Consumer Expenditure Surveys demonstrates practical utility.
Abstract
Synthetic data is a promising approach to privacy protection in many contexts. A Bayesian synthesis model, also known as a synthesizer, simulates synthetic values of sensitive variables from their posterior predictive distributions. The resulting synthetic data can then be released in place of the confidential data. An important evaluation prior to synthetic data release is its level of privacy protection, which is often in the form of disclosure risks evaluation. Attribute disclosure, referring to an intruder correctly inferring the confidential values of synthetic records, is one type of disclosure that is challenging to be computationally evaluated. In this paper, we review and discuss in detail some Bayesian estimation approaches to attribute disclosure risks evaluation, with examples of commonly-used Bayesian synthesizers. We create the R package…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Statistical Methods and Bayesian Inference · Probability and Risk Models
