Encode, Shuffle, Analyze Privacy Revisited: Formalizations and Empirical Evaluation
\'Ulfar Erlingsson, Vitaly Feldman, Ilya Mironov, Ananth Raghunathan,, Shuang Song, Kunal Talwar, Abhradeep Thakurta

TL;DR
This paper revisits the Encode-Shuffle-Analyze framework for privacy-preserving reporting, providing formal analysis, guidelines, and empirical evaluations demonstrating improved privacy-utility tradeoffs and applicability to real-world data and machine learning tasks.
Contribution
It offers a formal treatment of the ESA framework, introduces new privacy bounds, and empirically evaluates privacy-preserving techniques on real datasets and neural network training.
Findings
Fragmentation improves privacy and utility tradeoffs.
Formal bounds clarify limitations of sketch-based encodings.
Empirical results show effective privacy-preserving neural network training.
Abstract
Recently, a number of approaches and techniques have been introduced for reporting software statistics with strong privacy guarantees. These range from abstract algorithms to comprehensive systems with varying assumptions and built upon local differential privacy mechanisms and anonymity. Based on the Encode-Shuffle-Analyze (ESA) framework, notable results formally clarified large improvements in privacy guarantees without loss of utility by making reports anonymous. However, these results either comprise of systems with seemingly disparate mechanisms and attack models, or formal statements with little guidance to practitioners. Addressing this, we provide a formal treatment and offer prescriptive guidelines for privacy-preserving reporting with anonymity. We revisit the ESA framework with a simple, abstract model of attackers as well as assumptions covering it and other proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Internet Traffic Analysis and Secure E-voting
