The Limits of Pan Privacy and Shuffle Privacy for Learning and Estimation
Albert Cheu, Jonathan Ullman

TL;DR
This paper demonstrates that for high-dimensional learning and estimation, the shuffle and pan-private models require exponentially more samples than the central differential privacy model, establishing fundamental limitations.
Contribution
It provides the first non-trivial lower bounds showing exponential sample complexity gaps for high-dimensional problems in shuffle and pan-private models.
Findings
Private agnostic learning of parity functions requires exponential samples.
Selecting the most common attribute needs exponentially more samples.
Lower bounds are established for pan-private and shuffle models.
Abstract
There has been a recent wave of interest in intermediate trust models for differential privacy that eliminate the need for a fully trusted central data collector, but overcome the limitations of local differential privacy. This interest has led to the introduction of the shuffle model (Cheu et al., EUROCRYPT 2019; Erlingsson et al., SODA 2019) and revisiting the pan-private model (Dwork et al., ITCS 2010). The message of this line of work is that, for a variety of low-dimensional problems -- such as counts, means, and histograms -- these intermediate models offer nearly as much power as central differential privacy. However, there has been considerably less success using these models for high-dimensional learning and estimation problems. In this work, we show that, for a variety of high-dimensional learning and estimation problems, both the shuffle model and the pan-private model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
