VQ-SAD: Vector Quantized Structure Aware Diffusion For Molecule Generation

Farshad Noravesh; Reza Haffari; Layki Soon; Arghya Pal

arXiv:2605.00354·cs.LG·May 4, 2026

VQ-SAD: Vector Quantized Structure Aware Diffusion For Molecule Generation

Farshad Noravesh, Reza Haffari, Layki Soon, Arghya Pal

PDF

TL;DR

VQ-SAD introduces a neuro-symbolic diffusion model for molecule generation that leverages vector quantized atom and bond codes as latent variables, improving over existing methods on standard datasets.

Contribution

It proposes a novel VQ-VAE based framework that uses codebooks as tokenizers for diffusion, combining symbolic and neural information for better molecule generation.

Findings

01

VQ-SAD slightly outperforms state-of-the-art models on QM9 and ZINC250k datasets.

02

The large discrete code space improves the denoising process in diffusion models.

03

Using frozen pretrained VQ-VAE enhances molecule generation quality.

Abstract

Many diffusion based molecule generation methods ignore the symbolic information of molecules and represent the atom and bond type as one hot representation. Methods based on Morgan fingerprints produce hash collisions and are hard to embed into a continuous space without information loss and random fingerprints correspond to no valid molecule. To circumvent this issue we use another paradigm and consider atom and bond codes as latent variables of VQ-VAE. We introduce VQ-SAD which first trains a VQ-VAE and uses the frozen pretrained VQ-VAE model and considers the codebooks for both atom and bond types as tokenizers for the downstream diffusion process. VQ-SAD is a neuro-symbolic model that utilizes both symbolic and neural structural information for a diffusion based model with learnable forward process. The large discrete code space provides a more balanced atom and bond types which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.