A smoothing model for sample disclosure risk estimation
Yosef Rinott, Natalie Shlomo

TL;DR
This paper introduces a smoothing-based method for estimating sample disclosure risk by leveraging local neighborhood information of cells, improving risk assessment accuracy over existing methods.
Contribution
The paper proposes a novel smoothing model for population cell frequency estimation that incorporates local neighborhood information, enhancing disclosure risk estimation accuracy.
Findings
Preliminary results show the smoothing method performs well.
Comparisons indicate advantages over log-linear and Argus methods.
Method effectively estimates small cell frequencies for privacy risk assessment.
Abstract
When a sample frequency table is published, disclosure risk arises when some individuals can be identified on the basis of their values in certain attributes in the table called key variables, and then their values in other attributes may be inferred, and their privacy is violated. On the basis of the sample to be released, and possibly some partial knowledge of the whole population, an agency which considers releasing the sample, has to estimate the disclosure risk. Risk arises from non-empty sample cells which represent small population cells and from population uniques in particular. Therefore risk estimation requires assessing how many of the relevant population cells are likely to be small. Various methods have been proposed for this task, and we present a method in which estimation of a population cell frequency is based on smoothing using a local neighborhood of this cell, that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
