Exact and Efficient Bayesian Inference for Privacy Risk Quantification   (Extended Version)

Rasmus C. R{\o}nneberg; Ra\'ul Pardo; Andrzej W\k{a}sowski

arXiv:2308.16700·cs.CR·September 1, 2023

Exact and Efficient Bayesian Inference for Privacy Risk Quantification (Extended Version)

Rasmus C. R{\o}nneberg, Ra\'ul Pardo, Andrzej W\k{a}sowski

PDF

Open Access

TL;DR

This paper introduces an exact Bayesian inference engine using multivariate Gaussian distributions to precisely and efficiently quantify privacy risks in data analysis programs, improving over approximate methods like MCMC.

Contribution

It presents a novel exact inference approach for privacy risk analysis based on Gaussian models, implemented for Python programs, and demonstrates its effectiveness and efficiency.

Findings

01

Accurately quantifies privacy risks in data analysis programs.

02

Outperforms existing approximate inference methods.

03

Effectively analyzes the impact of differential privacy.

Abstract

Data analysis has high value both for commercial and research purposes. However, disclosing analysis results may pose severe privacy risk to individuals. Privug is a method to quantify privacy risks of data analytics programs by analyzing their source code. The method uses probability distributions to model attacker knowledge and Bayesian inference to update said knowledge based on observable outputs. Currently, Privug uses Markov Chain Monte Carlo (MCMC) to perform inference, which is a flexible but approximate solution. This paper presents an exact Bayesian inference engine based on multivariate Gaussian distributions to accurately and efficiently quantify privacy risks. The inference engine is implemented for a subset of Python programs that can be modeled as multivariate Gaussian models. We evaluate the method by analyzing privacy risks in programs to release public statistics. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Software Reliability and Analysis Research