Differential privacy and noisy confidentiality concepts for European population statistics
Fabian Bach

TL;DR
This paper reviews statistical disclosure control methods using noise, emphasizing differential privacy, and discusses how different noise distributions impact census data utility and privacy in European population statistics.
Contribution
It clarifies the distinction between risk measures, noise distributions, and output mechanisms, and analyzes their implications for privacy and utility in census data.
Findings
Bounded noise distributions preserve census features better.
Unbounded noise like Laplace can threaten key census features.
Differential privacy mechanisms are constrained in census scenarios.
Abstract
The paper aims to give an overview of various approaches to statistical disclosure control based on random noise that are currently being discussed for official population statistics and censuses. A particular focus is on a stringent delineation between different concepts influencing the discussion: we separate clearly between risk measures, noise distributions and output mechanisms - putting these concepts into scope and into relation with each other. After recapitulating differential privacy as a risk measure, the paper also remarks on utility and risk aspects of some specific output mechanisms and parameter setups, with special attention on static outputs that are rather typical in official population statistics. In particular, it is argued that unbounded noise distributions, such as plain Laplace, may jeopardise key unique census features without a clear need from a risk…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
