Database Matching Under Adversarial Column Deletions
Serhat Bakirtas, Elza Erkip

TL;DR
This paper studies how to match anonymized databases when an adversary can delete a fraction of columns to hinder matching, revealing the limits of privacy-preserving data publication under adversarial attacks.
Contribution
It introduces a two-phase scheme for adversarial database matching, providing necessary and sufficient conditions, and characterizes the adversarial matching capacity.
Findings
Adversarial column deletions significantly lower matching capacity than random deletions.
A two-phase detection and matching scheme achieves near-perfect recovery under certain conditions.
Adversarial mechanisms offer stronger privacy protection compared to random distortions.
Abstract
The de-anonymization of users from anonymized microdata through matching or aligning with publicly-available correlated databases has been of scientific interest recently. While most of the rigorous analyses of database matching have focused on random-distortion models, the adversarial-distortion models have been wanting in the relevant literature. In this work, motivated by synchronization errors in the sampling of time-indexed microdata, matching (alignment) of random databases under adversarial column deletions is investigated. It is assumed that a constrained adversary, which observes the anonymized database, can delete up to a fraction of the columns (attributes) to hinder matching and preserve privacy. Column histograms of the two databases are utilized as permutation-invariant features to detect the column deletion pattern chosen by the adversary. The detection of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Internet Traffic Analysis and Secure E-voting · Cryptography and Data Security
