Generative Trees: Adversarial and Copycat
Richard Nock, Mathieu Guillame-Bert

TL;DR
This paper introduces generative trees (GTs), a novel tree-based generative model for tabular data that leverages classical decision tree principles and adversarial training, offering interpretable alternatives to neural network methods.
Contribution
It proposes a new generative model for tabular data based on decision trees, with a variational GAN-style loss and a boosting-compatible adversarial training algorithm.
Findings
GTs perform well on fake/real distinction tasks.
GTs effectively handle training from fake data.
GTs provide interpretable solutions for missing data imputation.
Abstract
While Generative Adversarial Networks (GANs) achieve spectacular results on unstructured data like images, there is still a gap on tabular data, data for which state of the art supervised learning still favours to a large extent decision tree (DT)-based models. This paper proposes a new path forward for the generation of tabular data, exploiting decades-old understanding of the supervised task's best components for DT induction, from losses (properness), models (tree-based) to algorithms (boosting). The \textit{properness} condition on the supervised loss -- which postulates the optimality of Bayes rule -- leads us to a variational GAN-style loss formulation which is \textit{tight} when discriminators meet a calibration property trivially satisfied by DTs, and, under common assumptions about the supervised loss, yields "one loss to train against them all" for the generator: the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
MethodsGoal-Driven Tree-Structured Neural Model
