Bayesian Generative Adversarial Networks via Gaussian Approximation for Tabular Data Synthesis
Bahrul Ilmi Nasution, Mark Elliot, Richard Allmendinger

TL;DR
This paper introduces GACTGAN, a Bayesian GAN variant using Gaussian approximation to synthesize tabular data more effectively and efficiently than existing methods, with improved data quality and privacy features.
Contribution
GACTGAN integrates SWAG for Bayesian approximation in CTGAN, reducing computational costs while enhancing data synthesis quality for tabular data.
Findings
GACTGAN outperforms CTGAN in data quality and privacy preservation.
GACTGAN reduces computational overhead compared to MCMC-based Bayesian GANs.
GACTGAN better maintains tabular structure and statistical properties.
Abstract
Generative Adversarial Networks (GAN) have been used in many studies to synthesise mixed tabular data. Conditional tabular GAN (CTGAN) have been the most popular variant but struggle to effectively navigate the risk-utility trade-off. Bayesian GAN have received less attention for tabular data, but have been explored with unstructured data such as images and text. The most used technique employed in Bayesian GAN is Markov Chain Monte Carlo (MCMC), but it is computationally intensive, particularly in terms of weight storage. In this paper, we introduce Gaussian Approximation of CTGAN (GACTGAN), an integration of the Bayesian posterior approximation technique using Stochastic Weight Averaging-Gaussian (SWAG) within the CTGAN generator to synthesise tabular data, reducing computational overhead after the training phase. We demonstrate that GACTGAN yields better synthetic data compared to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Machine Learning in Healthcare
