Differentially Private Synthetic Data Using KD-Trees
Eleonora Krea\v{c}i\'c, Navid Nouri, Vamsi K. Potluru, Tucker Balch,, Manuela Veloso

TL;DR
This paper introduces space partitioning methods combined with noise perturbation to generate differentially private synthetic data, offering transparent algorithms with improved utility and scalability over deep generative models.
Contribution
It presents novel data independent and data dependent algorithms using space partitioning for differentially private synthetic data generation, overcoming high-dimensional challenges.
Findings
Empirical utility improvements over prior methods
Theoretical analysis of utility-privacy trade-offs
Scalable algorithms effective in high dimensions
Abstract
Creation of a synthetic dataset that faithfully represents the data distribution and simultaneously preserves privacy is a major research challenge. Many space partitioning based approaches have emerged in recent years for answering statistical queries in a differentially private manner. However, for synthetic data generation problem, recent research has been mainly focused on deep generative models. In contrast, we exploit space partitioning techniques together with noise perturbation and thus achieve intuitive and transparent algorithms. We propose both data independent and data dependent algorithms for -differentially private synthetic data generation whose kernel density resembles that of the real dataset. Additionally, we provide theoretical results on the utility-privacy trade-offs and show how our data dependent approach overcomes the curse of dimensionality and leads…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Data Mining Algorithms and Applications
