DeSIA: Attribute Inference Attacks Against Limited Fixed Aggregate   Statistics

Yifeng Mao; Bozhidar Stevanoski; Yves-Alexandre de Montjoye

arXiv:2504.18497·cs.CR·April 28, 2025·2 cites

DeSIA: Attribute Inference Attacks Against Limited Fixed Aggregate Statistics

Yifeng Mao, Bozhidar Stevanoski, Yves-Alexandre de Montjoye

PDF

Open Access

TL;DR

This paper introduces DeSIA, a novel attribute inference attack targeting limited fixed aggregate statistics, demonstrating significant privacy risks even with minimal data release and highlighting the need for formal privacy protections.

Contribution

DeSIA is a new inference attack framework that effectively infers user attributes from limited aggregate data, outperforming existing reconstruction methods and adaptable to membership inference.

Findings

01

DeSIA achieves a true positive rate of 0.14 at a false positive rate of 0.001.

02

Aggregation alone does not sufficiently protect privacy with few released statistics.

03

DeSIA performs well against unverifiable attributes and varying noise levels.

Abstract

Empirical inference attacks are a popular approach for evaluating the privacy risk of data release mechanisms in practice. While an active attack literature exists to evaluate machine learning models or synthetic data release, we currently lack comparable methods for fixed aggregate statistics, in particular when only a limited number of statistics are released. We here propose an inference attack framework against fixed aggregate statistics and an attribute inference attack called DeSIA. We instantiate DeSIA against the U.S. Census PPMF dataset and show it to strongly outperform reconstruction-based attacks. In particular, we show DeSIA to be highly effective at identifying vulnerable users, achieving a true positive rate of 0.14 at a false positive rate of $1 0^{- 3}$ . We then show DeSIA to perform well against users whose attributes cannot be verified and when varying the number of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Data Quality and Management