Testing for large-dimensional covariance matrix under differential privacy
Shiwei Sang, Yicheng Zeng, Xuehu Zhu, Shurong Zheng

TL;DR
This paper introduces a differentially private test for large-dimensional covariance matrices that maintains statistical power and privacy guarantees, even in high-dimensional settings with unbounded data.
Contribution
It develops a privacy-preserving covariance test that is asymptotically distribution-free and effective in high-dimensional regimes without assuming bounded data.
Findings
Guarantees privacy with high probability for large samples
Test is asymptotically distribution-free with known critical values
Detects local alternatives at the optimal rate of 1/√n
Abstract
The increasing prevalence of high-dimensional data across various applications has raised significant privacy concerns in statistical inference. In this paper, we propose a differentially private integrated statistic for testing large-dimensional covariance structures, enabling accurate statistical insights while safeguarding privacy. First, we analyze the global sensitivity of sample eigenvalues for sub-Gaussian populations, where our method bypasses the commonly assumed boundedness of data covariates. For sufficiently large sample size, the privatized statistic guarantees privacy with high probability. Furthermore, when the ratio of dimension to sample size, , the privatized test is asymptotically distribution-free with well-known critical values, and detects the local alternative hypotheses distinct from the null at the fastest rate of .…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Random Matrices and Applications · Probability and Risk Models
