DeSIA: Attribute Inference Attacks Against Limited Fixed Aggregate Statistics
Yifeng Mao, Bozhidar Stevanoski, Yves-Alexandre de Montjoye

TL;DR
This paper introduces DeSIA, a novel attribute inference attack targeting limited fixed aggregate statistics, demonstrating significant privacy risks even with minimal data release and highlighting the need for formal privacy protections.
Contribution
DeSIA is a new inference attack framework that effectively infers user attributes from limited aggregate data, outperforming existing reconstruction methods and adaptable to membership inference.
Findings
DeSIA achieves a true positive rate of 0.14 at a false positive rate of 0.001.
Aggregation alone does not sufficiently protect privacy with few released statistics.
DeSIA performs well against unverifiable attributes and varying noise levels.
Abstract
Empirical inference attacks are a popular approach for evaluating the privacy risk of data release mechanisms in practice. While an active attack literature exists to evaluate machine learning models or synthetic data release, we currently lack comparable methods for fixed aggregate statistics, in particular when only a limited number of statistics are released. We here propose an inference attack framework against fixed aggregate statistics and an attribute inference attack called DeSIA. We instantiate DeSIA against the U.S. Census PPMF dataset and show it to strongly outperform reconstruction-based attacks. In particular, we show DeSIA to be highly effective at identifying vulnerable users, achieving a true positive rate of 0.14 at a false positive rate of . We then show DeSIA to perform well against users whose attributes cannot be verified and when varying the number of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Data Quality and Management
