Bayesian Advantage of Re-Identification Attack in the Shuffle Model
Pengcheng Su, Haibo Cheng, Ping Wang

TL;DR
This paper systematically analyzes the Bayesian advantage in re-identification attacks within the shuffle model, deriving exact formulas, asymptotic behaviors, and bounds, including the first upper bound in the context of shuffle differential privacy.
Contribution
It provides the first analytical and asymptotic characterization of Bayesian re-identification advantage in the shuffle model, along with bounds applicable to differential privacy settings.
Findings
Exact expressions for re-identification success probability.
Asymptotic analysis of Bayesian advantage.
Upper bound on re-identification probability in shuffle differential privacy.
Abstract
The shuffle model, which anonymizes data by randomly permuting user messages, has been widely adopted in both cryptography and differential privacy. In this work, we present the first systematic study of the Bayesian advantage in re-identifying a user's message under the shuffle model. We begin with a basic setting: one sample is drawn from a distribution , and samples are drawn from a distribution , after which all samples are randomly shuffled. We define as the success probability of a Bayes-optimal adversary in identifying the sample from , and define the additive and multiplicative Bayesian advantages as and , respectively. We derive exact analytical expressions and asymptotic characterizations of , along with evaluations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Smart Grid Security and Resilience
