Beyond One-Size-Fits-All: Neural Networks for Differentially Private Tabular Data Synthesis
Kai Chen, Chen Gong, Tianhao Wang

TL;DR
This paper introduces MargNet, a neural network-based method for differentially private tabular data synthesis that excels in complex, densely correlated datasets by combining statistical model strategies with neural networks, achieving high utility and speed.
Contribution
MargNet innovatively integrates statistical model strategies into neural networks for DP tabular data synthesis, especially effective for densely correlated datasets.
Findings
Achieves utility close to the best statistical methods on sparse data.
Provides a 7× speedup over traditional statistical models.
Reduces fidelity error by up to 26% on dense datasets.
Abstract
In differentially private (DP) tabular data synthesis, the consensus is that statistical models are better than neural network (NN)-based methods. However, we argue that this conclusion is incomplete and overlooks the challenge of densely correlated datasets, where intricate dependencies can overwhelm statistical models. In such complex scenarios, neural networks are more suitable due to their capacity to fit complex distributions by learning directly from samples. Despite this potential, existing NN-based algorithms still suffer from significant limitations. We therefore propose MargNet, incorporating successful algorithmic designs of statistical models into neural networks. MargNet applies an adaptive marginal selection strategy and trains the neural networks to generate data that conforms to the selected marginals. On sparsely correlated datasets, our approach achieves utility close…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Stochastic Gradient Optimization Techniques
