DiGress: Discrete Denoising diffusion for graph generation
Clement Vignac, Igor Krawczuk, Antoine Siraudin, Bohan Wang, Volkan, Cevher, Pascal Frossard

TL;DR
DiGress introduces a novel discrete diffusion model for graph generation that improves sample quality and scalability, achieving state-of-the-art results on molecular datasets and enabling large-scale drug molecule generation.
Contribution
It presents the first scalable discrete diffusion model for graph generation that incorporates a Markovian noise process and auxiliary features, advancing graph generative modeling.
Findings
Achieves up to 3x validity improvement on planar graphs.
Scales to 1.3 million drug-like molecules without specialized representations.
Sets new state-of-the-art performance on molecular graph datasets.
Abstract
This work introduces DiGress, a discrete denoising diffusion model for generating graphs with categorical node and edge attributes. Our model utilizes a discrete diffusion process that progressively edits graphs with noise, through the process of adding or removing edges and changing the categories. A graph transformer network is trained to revert this process, simplifying the problem of distribution learning over graphs into a sequence of node and edge classification tasks. We further improve sample quality by introducing a Markovian noise model that preserves the marginal distribution of node and edge types during diffusion, and by incorporating auxiliary graph-theoretic features. A procedure for conditioning the generation on graph-level features is also proposed. DiGress achieves state-of-the-art performance on molecular and non-molecular datasets, with up to 3x validity improvement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Graph Neural Networks · Computational Drug Discovery Methods · Machine Learning in Materials Science
MethodsAttention Is All You Need · Linear Layer · Label Smoothing · Multi-Head Attention · Position-Wise Feed-Forward Layer · Adam · Absolute Position Encodings · Softmax · Residual Connection · Diffusion
