A systematic comparison of measures for publishing k-anonymous social network data
Rachel G. de Jong, Mark P. J. van der Loo, Frank W. Takes

TL;DR
This paper systematically compares different measures of k-anonymity in social networks, revealing how the choice of measure affects privacy assessment, utility, and computational costs, with empirical validation on large datasets.
Contribution
It provides a theoretical framework and empirical analysis comparing prominent k-anonymity measures, guiding better privacy-preserving social network data sharing.
Findings
The choice of anonymity measure significantly affects privacy and utility.
Measures considering larger node vicinities can be more effective with limited structural info.
Computational costs vary widely depending on the measure used.
Abstract
Sharing or publishing social network data while accounting for privacy of individuals is a difficult task due to the interconnectedness of nodes in networks. A key question in k-anonymity, a widely studied notion of privacy, is how to measure the anonymity of an individual, as this determines the attacker scenarios one protects against. In this paper, we systematically compare the most prominent anonymity measures from the literature in terms of the completeness and reach of the structural information they take into account. We present a theoretical characterization and a distance-parametrized strictness ordering of the existing measures for k-anonymity in networks. In addition, we conduct empirical experiments on a wide range of real-world network datasets with up to millions of edges. Our findings reveal that the choice of the measure significantly impacts the measured level of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInternet Traffic Analysis and Secure E-voting · Privacy-Preserving Technologies in Data · Privacy, Security, and Data Protection
