Learning Joint 2D & 3D Diffusion Models for Complete Molecule Generation
Han Huang, Leilei Sun, Bowen Du, Weifeng Lv

TL;DR
This paper introduces JODO, a joint 2D and 3D diffusion model for complete molecule generation, improving the quality and versatility of generated molecules by modeling both molecular graphs and geometries simultaneously.
Contribution
The paper proposes a novel Diffusion Graph Transformer that jointly models 2D and 3D molecular data, enabling more accurate and versatile molecule generation.
Findings
JODO outperforms baselines on QM9 and GEOM-Drugs datasets.
The model achieves fast few-step sampling.
It effectively supports inverse molecular design.
Abstract
Designing new molecules is essential for drug discovery and material science. Recently, deep generative models that aim to model molecule distribution have made promising progress in narrowing down the chemical research space and generating high-fidelity molecules. However, current generative models only focus on modeling either 2D bonding graphs or 3D geometries, which are two complementary descriptors for molecules. The lack of ability to jointly model both limits the improvement of generation quality and further downstream applications. In this paper, we propose a new joint 2D and 3D diffusion model (JODO) that generates complete molecules with atom types, formal charges, bond information, and 3D coordinates. To capture the correlation between molecular graphs and geometries in the diffusion process, we develop a Diffusion Graph Transformer to parameterize the data prediction model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science · Protein Structure and Dynamics
MethodsMulti-Head Attention · Attention Is All You Need · Softmax · Layer Normalization · Laplacian EigenMap · Byte Pair Encoding · Dropout · Linear Layer · Label Smoothing · Diffusion
