TL;DR
This paper introduces a novel iterative algorithm for differentially private PCA that effectively estimates the top k eigenvectors with near-optimal utility, overcoming previous limitations related to sample size and noise.
Contribution
It presents the first algorithm capable of estimating the top k eigenvectors under differential privacy, with improved sample complexity and noise handling for arbitrary k.
Findings
Achieves near-optimal statistical error for k=1.
Provides a lower bound for k>1 matching the upper bound up to a factor of k.
Demonstrates experimental advantages over baseline methods.
Abstract
Given i.i.d. random matrices that share a common expectation , the objective of Differentially Private Stochastic PCA is to identify a subspace of dimension that captures the largest variance directions of , while preserving differential privacy (DP) of each individual . Existing methods either (i) require the sample size to scale super-linearly with dimension , even under Gaussian assumptions on the , or (ii) introduce excessive noise for DP even when the intrinsic randomness within is small. Liu et al. (2022a) addressed these issues for sub-Gaussian data but only for estimating the top eigenvector () using their algorithm DP-PCA. We propose the first algorithm capable of estimating the top eigenvectors for arbitrary , whilst overcoming both limitations above. For our algorithm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
