Latent Noise Injection for Private and Statistically Aligned Synthetic Data Generation
Rex Shen, Lu Tian

TL;DR
This paper introduces Latent Noise Injection with Masked Autoregressive Flows to generate privacy-preserving synthetic data that closely matches the true distribution, especially in high-dimensional settings, while enabling reliable statistical inference.
Contribution
It proposes a novel latent space perturbation method that improves convergence and privacy guarantees in synthetic data generation using flow models.
Findings
Achieves local differential privacy with a single perturbation parameter.
Restores classical efficiency through meta-analysis aggregation.
Demonstrates robustness against membership inference attacks.
Abstract
Synthetic Data Generation has become essential for scalable, privacy-preserving statistical analysis. While standard approaches based on generative models, such as Normalizing Flows, have been widely used, they often suffer from slow convergence in high-dimensional settings, frequently converging more slowly than the canonical rate when approximating the true data distribution. To overcome these limitations, we propose a Latent Noise Injection method using Masked Autoregressive Flows (MAF). Instead of directly sampling from the trained model, our method perturbs each data point in the latent space and maps it back to the data domain. This construction preserves a one to one correspondence between observed and synthetic data, enabling synthetic outputs that closely reflect the underlying distribution, particularly in challenging high-dimensional regimes where traditional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Generative Adversarial Networks and Image Synthesis · Stochastic Gradient Optimization Techniques
MethodsNormalizing Flows
