TabFairGAN: Fair Tabular Data Generation with Generative Adversarial Networks
Amirarsalan Rajabi, Ozlem Ozmen Garibay

TL;DR
TabFairGAN introduces a two-phase Wasserstein GAN approach for generating fair synthetic tabular data, outperforming existing methods in accuracy and fairness while maintaining stability and avoiding common GAN issues.
Contribution
The paper presents a novel two-phase training process for GANs that incorporates fairness constraints, improving stability and performance in fair tabular data generation.
Findings
Outperforms state-of-the-art GANs in accurate data generation
Achieves promising fairness results on multiple datasets
Maintains stability with a single critic and avoids mode-dropping
Abstract
With the increasing reliance on automated decision making, the issue of algorithmic fairness has gained increasing importance. In this paper, we propose a Generative Adversarial Network for tabular data generation. The model includes two phases of training. In the first phase, the model is trained to accurately generate synthetic data similar to the reference dataset. In the second phase we modify the value function to add fairness constraint, and continue training the network to generate data that is both accurate and fair. We test our results in both cases of unconstrained, and constrained fair data generation. In the unconstrained case, i.e. when the model is only trained in the first phase and is only meant to generate accurate data following the same joint probability distribution of the real data, the results show that the model beats state-of-the-art GANs proposed in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI
