Data Synthesis based on Generative Adversarial Networks
Noseong Park, Mahmoud Mohammadi, Kshitij Gorde, Sushil Jajodia,, Hongkyu Park, Youngmin Kim

TL;DR
This paper introduces table-GAN, a generative adversarial network-based method for synthesizing data that preserves utility for machine learning models while enhancing privacy, outperforming existing techniques in balancing privacy and utility.
Contribution
The paper presents a novel GAN-based data synthesis method that maintains data utility for models and ensures privacy, addressing limitations of traditional anonymization techniques.
Findings
Table-GAN achieves a better privacy-utility balance than existing methods.
Models trained on synthetic data from table-GAN perform similarly to those trained on original data.
Table-GAN consistently outperforms state-of-the-art anonymization and perturbation techniques.
Abstract
Privacy is an important concern for our society where sharing data with partners or releasing data to the public is a frequent occurrence. Some of the techniques that are being used to achieve privacy are to remove identifiers, alter quasi-identifiers, and perturb values. Unfortunately, these approaches suffer from two limitations. First, it has been shown that private information can still be leaked if attackers possess some background knowledge or other information sources. Second, they do not take into account the adverse impact these methods will have on the utility of the released data. In this paper, we propose a method that meets both requirements. Our method, called table-GAN, uses generative adversarial networks (GANs) to synthesize fake tables that are statistically similar to the original table yet do not incur information leakage. We show that the machine learning models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
