Differentially Private Truncation of Unbounded Data via Public Second Moments
Zilong Cao, Xuan Bi, Hai Zhang

TL;DR
This paper introduces a method called PMT that uses public second-moment information to enable differential privacy on unbounded data, improving model accuracy and stability.
Contribution
The paper proposes a novel public-moment-guided truncation technique that enhances differential privacy by conditioning data transformations on public second moments.
Findings
PMT improves DP model accuracy on synthetic data.
PMT enhances stability and robustness of DP models.
Theoretical bounds confirm improved error rates.
Abstract
Data privacy is important in the AI era, and differential privacy (DP) is one of the golden solutions. However, DP is typically applicable only if data have a bounded underlying distribution. We address this limitation by leveraging second-moment information from a small amount of public data. We propose Public-moment-guided Truncation (PMT), which transforms private data using the public second-moment matrix and applies a principled truncation whose radius depends only on non-private quantities: data dimension and sample size. This transformation yields a well-conditioned second-moment matrix, enabling its inversion with a significantly strengthened ability to resist the DP noise. Furthermore, we demonstrate the applicability of PMT by using penalized and generalized linear regressions. Specifically, we design new loss functions and algorithms, ensuring that solutions in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Random Matrices and Applications · Stochastic Gradient Optimization Techniques
