TL;DR
This paper introduces GRDM, a novel graph neural network-based diffusion model that jointly generates entire relational databases without assuming table order, improving multi-table dependency modeling.
Contribution
The paper presents a new graph-conditional diffusion approach for joint RDB generation that overcomes limitations of sequential models and captures complex inter-table relations.
Findings
Outperforms autoregressive baselines on multi-hop inter-table correlations
Achieves state-of-the-art results on single-table fidelity metrics
Demonstrates effectiveness on six real-world RDBs
Abstract
Building generative models for relational databases (RDBs) is important for many applications, such as privacy-preserving data release and augmenting real datasets. However, most prior works either focus on single-table generation or adapt single-table models to the multi-table setting by relying on autoregressive factorizations and sequential generation. These approaches limit parallelism, restrict flexibility in downstream applications, and compound errors due to commonly made conditional independence assumptions. In this paper, we propose a fundamentally different approach: jointly modeling all tables in an RDB without imposing any table order. By using a natural graph representation of RDBs, we propose the Graph-Conditional Relational Diffusion Model (GRDM), which leverages a graph neural network to jointly denoise row attributes and capture complex inter-table dependencies.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
