High-Dimensional Asymptotics of Differentially Private PCA
Youngjoo Yun, Rishabh Dudeja

TL;DR
This paper provides precise privacy loss characterizations for differentially private PCA in high dimensions, revealing the exact difficulty of detecting individual presence and improving understanding of privacy-utility trade-offs.
Contribution
It offers sharp, dataset-specific privacy bounds for DP PCA in high dimensions, using hypothesis testing and Le Cam's methods, advancing beyond loose upper bounds.
Findings
Detecting an individual in privatized PCA is as hard as Gaussian mean testing.
Sharp privacy bounds depend on spectral properties of the dataset.
High-dimensional analysis reveals exact privacy-utility trade-offs.
Abstract
In differential privacy, random noise is introduced to privatize summary statistics of a sensitive dataset before releasing them. The noise level determines the privacy loss, which quantifies how easily an adversary can detect a target individual's presence in the dataset using the published statistic. Most privacy analyses provide upper bounds on the privacy loss. Sometimes, these bounds offer weak privacy guarantees unless the noise level is so high that it overwhelms the meaningful signal. It is unclear whether such high noise levels are necessary or a limitation of loose and pessimistic privacy bounds. This paper explores whether it is possible to obtain sharp privacy characterizations that determine the exact privacy loss of a mechanism on a given dataset. We study this problem in the context of differentially private principal component analysis (PCA), where the goal is to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Random Matrices and Applications
