Graph-Conditional Flow Matching for Relational Data Generation
Davide Scassola, Sebastiano Saccani, Luca Bortolussi

TL;DR
This paper introduces a graph-conditional flow matching model for relational data generation that captures complex dependencies and foreign-key relationships, improving the fidelity of synthetic relational datasets.
Contribution
It presents a novel deep generative model leveraging flow matching and graph neural networks to generate relational data with complex structures and long-range dependencies.
Findings
Achieves state-of-the-art performance on benchmark datasets.
Supports complex relational structures with multiple foreign-key relationships.
Generates high-fidelity synthetic relational data.
Abstract
Data synthesis is gaining momentum as a privacy-enhancing technology. While single-table tabular data generation has seen considerable progress, current methods for multi-table data often lack the flexibility and expressiveness needed to capture complex relational structures. In particular, they struggle with long-range dependencies and complex foreign-key relationships, such as tables with multiple parent tables or multiple types of links between the same pair of tables. We propose a generative model for relational data that generates the content of a relational dataset given the graph formed by the foreign-key relationships. We do this by learning a deep generative model of the content of the whole relational database by flow matching, where the neural network trained to denoise records leverages a graph neural network to obtain information from connected records. Our method is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsData Stream Mining Techniques · Advanced Database Systems and Queries · Data Quality and Management
MethodsGraph Neural Network
